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METH1 and METH2 Polynucleotides and Polypeptides 
Background of the Invention 



Federally-Sponsored Research and Development 

Part of the work performed during development of this invention utilized 
US. Government funds. The U.S. Government has certain rights in this invention. 

Field of the Invention 

The present invention relates to novel anti-angiogenic proteins, related to 
thrombospondin. More specifically, isolated nucleic acid molecules are provided 
encoding human METH1 and METH2 (ME, for metalloprotease, and TH, for 
thrombospondin). METH1 and METH2 polypeptides are also provided, as are 
vectors, host cells and recombinant methods for producing the same. Also 
provided are diagnostic methods for the prognosis of cancer and therapeutic 
methods for treating individuals in need of an increased amount of METH1 or 
METH2. 



Related AH 



Angiogenesis, the formation of new blood vessels from pre-existing 
vasculature, is a tightly regulated process in normal adults. Under physiological 
circumstances, growth of new capillaries is tightly controlled by an interplay of 
growth regulatory proteins which act either to stimulate or to inhibit blood vessel 
growth. Normally, the balance between these forces is tipped in favor of inhibition 
and consequently blood vessel growth is restrained. Under certain pathological 
circumstances, however, local inhibitory controls are unable to restrain the. 
increased activity of angiogenic inducers. Angiogenesis is a key step in the 
metastasis of cancer (Folkman, Nature Med 7:27-31 (1995)) and in abnormal 
wound healing, inflammation, rheumatoid arthritis, psoriasis, and diabetic 
retinopathy, it is integral to the pathology (Folkman et al. y Science 235:442-447 
(1987)), engendering the hope that these pathological entities could be regulated 
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by pharmacological and/or genetic suppression of blood vessel growth (Iruela- 
Arispee/a/., Thromb. Haem. 75:672-677 1997)). 

Thrombospondin-1 (TSP-1) is a 450 kDa, anti-angiogenic adhesive 
glycoprotein released from activated platelets and secreted by growing cells 
(reviewed in Adams, Int J. Biochem. Cell Biol 29:861-865 (1997)). TSP-1 is a 
homotrimer, with each subunit comprised of a 1152 amino acid residue 
polypeptide, post-translationally modified by AMinked glycosylation and beta- 
hydroxylation of asparagine residues. 

TSP-1 protein and mRNA levels are regulated by a variety of factors. 
TSP-1 protein levels are downregulated by IL-1 alpha and TNF alpha. TSP-1 
mRNA and protein levels are upregulated by polypeptide growth factors including 
PDGF, TGF-beta, and bFGF (Bornstein, Faseb J. 6: 3290-3299 (1992)) and are 
also regulated by the level of expression of the p53 tumor suppressor gene product 
(Dameron et al., Science 265: 1 582- 1 584 (1 994)). At least four other members 
of the thrombospondin family have been identified: TSP-2, TSP-3, TSP-4, and 
TSP-5 (also called COMP). There is a need in the art to identify other molecules 
involved in the regulation of angiogenesis. 

Summary of the Invention 

The present invention provides isolated nucleic acid molecules comprising 
a polynucleotide encoding the METH1 polypeptide having the amino acid 
sequence shown in SEQ ID NO:2 or the amino acid sequence encoded by the 
cDNA clone deposited in a bacterial host as ATCC Deposit Number 209581 on 
January 15, 1998. 

The present invention also provides isolated nucleic acid molecules 
comprising a polynucleotide encoding the METH2 polypeptide having the amino 
acid sequence shown in SEQ ID NO:4 or the amino acid sequence encoded by the 
cDNA clone deposited in a bacterial host as ATCC Deposit Number 209582 on 
January 15, 1998. 
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The present invention also relates to recombinant vectors, which include 
the isolated nucleic acid molecules of the present invention, and to host cells 
containing the recombinant vectors, as well as to methods of making such vectors 
and host cells and for using them for production of METH1 or METH2 
polypeptides or peptides by recombinant techniques. 

The invention further provides an isolated METH1 or METH2 polypeptide 
having an amino acid sequence encoded by a polynucleotide described herein. 

The invention further provides a diagnostic method useful during diagnosis 
or prognosis of cancer. 

An additional aspect of the invention is related to a method for treating an 
individual in need of an increased level of METH1 or METH2 activity in the body 
comprising administering to such an individual a composition comprising a 
therapeutically effective amount of an isolated METH1 or METH2 polypeptide 
of the invention or an agonist thereof 

Brief Description of the Figures 

Figure 1 shows the nucleotide (SEQ ID NO:l) and deduced amino acid 
(SEQ ED NO:2) sequences of METH1. The protein has a predicted leader 
sequence of about 28 amino acid residues (underlined). 

Figure 2 shows the nucleotide (SEQ ID NO:3) and deduced amino acid 
(SEQ ID NO:4) sequences of METH2. The protein has a predicted leader 
sequence of about 23 amino acid residues (underlined). 

Figure 3 shows a comparison of the amino acid sequence ofMETHl (SEQ 
ID NO:2) and METH2 (SEQ ID NO:4) with that of their closest homologue, a 
bovine metalloprotease (pNPI) (SEQ ID NO:5). Identical amino acids are boxed. 
Functional domains predicted by sequence and structural homology are labeled, 
including the signal peptide (single line), the potential cleavage site for mammalian 
subtilisin (double underlined), the zinc-binding-site (dotted line) in the 
metalloprotease domain, and the putative disintegrin loops (arrows). 



Figure 4 shows the primary structure of METH1, METH2 and pNPI 
which includes a prodomain, a catalytic metalloprotease domain, a cysteine rich 
disintegrin domain, a TSP-like domain, a spacer region and a different number of 
TSP-like domains, three for METH1, two for METH2, and four for pNPI. 

Figure 5 shows a comparison of the TSP-like domain of METH1 (SEQ ID 
NO:2) and METH2 (SEQ ID NO:4) with those of TSP1 (SEQ ID NOs:6, 7, and 
8) and TSP2 (SEQ ID NOs:9, 10, and 11), cysteines are numbered 1 to .6, 
tryptophans are marked by asterisks. 

Figure 6 shows that peptides and recombinant protein derived from the 
TSP-like domain of METH1 and METH2 block VEGF-induced angiogenesis. 
Angiogenesis was induced on CAMs from 12-14-day-old embryos using a nylon 
mesh containing VEGF casted on matrigel and in the presence or absence of the 
peptides or recombinant protein. Capillary density was evaluated as described in 
Example 4. Positive and negative control included VEGF alone and vehicle alone, 
respectively. (A) Quantification of the angiogenic response induced by VEGF in 
the presence of recombinant proteins. TSP1, purified platelet TSP1, GST, 
purified GST, GST-TSP1, GST-METH1, and GST-METH2 are described in 
Example 4. (B) Quantification of the angiogenic response induced by VEGF in 
the presence or absence of the peptides; P-TSP1, P-METH1, and P-METH2 
(peptide derived from the Type I repeats of TSP, METH1 and METH2, 
respectively); SCI and SC2 are scramble peptides used as controls. (C) Dose- 
response of the VEGF-induced angiogenesis in the presence of GST-METH1 . 
(D) Dose-response of the VEGF-induced angiogenesis in the presence of GST- 
METH2. The angiogenic index was expressed considering the vascular response 
from the VEGF-matrigel as 1 00% and subtracting the background levels (matrigel 
alone). Assays were repeated, at least, twice. Each treatment was done in 
triplicate. Values represent the mean, bars indicate standard deviations. *p<0.001. 

Figure 7 shows the effect of METH1 and METH2 recombinant proteins 
on bFGF-stimulated cell proliferation. Cells were cultured on 24-well plates in 
media containing bFGF and the recombinant protein to be tested (3jig/ml, unless 
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indicated in the graph). Controls included vehicle or GST recombinant protein 
alone. (A), HDEC, human dermal endothelial cells; (B), HMEC, human mammary 
epithelial cells; (C), HDF, human dermal fibroblasts; (D), SMC, smooth muscle 
cells; (E) Dose-response of GST-METH1 and GST-METH2 on HDEC 
proliferation. Experiments were repeated, at least, twice. Each treatment was 
done in triplicate. Values represent the mean, bars indicate standard deviations. 
*p<0.01. 

Figure 8 shows a schematic representation of the pHE4-5 expression 
vector (SEQ ID NO: 12) and the subcloned METH1 or METH2 cDNA coding 
sequence. The locations of the kanamycin resistance marker gene, the METH1 
or METH2 coding sequence, the oriC sequence, and the laclq coding sequence 
are indicated. 

Figure 9 shows the nucleotide sequence of the regulatory elements of the 
pHE promoter (SEQ ID NO: 13). The two lac operator sequences, the 
Shine-Delgarno sequence (S/D), and the terminal HinctHl and Ndel restriction 
sites (italicized) are indicated. 

Figure 10 shows an analysis of the METH1 amino acid sequence. Alpha, 
beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 
regions; flexible regions; antigenic index and surface probability are shown, and 
all were generated using the default settings. In the "Antigenic Index or 
Jameson-Wolf 1 graph, the positive peaks indicate locations of the highly antigenic 
regions of the METH1 or METH2 protein, i.e., regions from which epitope- 
bearing peptides of the invention can be obtained. The domains defined by these 
graphs are contemplated by the present invention. Tabular representation of the 
data summarized graphically in Figure 10 can be found in Table 1 . 

Figure 1 1 shows an analysis of the METH2 amino acid sequence. Alpha, 
beta, turn and coil regions; hydrophilicity and hydrophobicity; amphipathic 
regions; flexible regions; antigenic index and surface probability are shown, and 
all were generated using the default settings. In the "Antigenic Index or 
Jameson-Wolf 1 graph, the positive peaks indicate locations of the highly antigenic 
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regions of the METH1 or METH2 protein, i.e., regions from which epitope- 
bearing peptides of the invention can be obtained. The domains defined by these 
graphs are contemplated by the present inventioa Tabular representation of the 
data summarized graphically in Figure 1 1 can be found in Table 2. 
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Detailed Description 

By screening cDNA libraries with cDNA encoding the anti-angiogenic 
domain of TSP-1, the present inventors have identified two novel proteins, 
METH1 and METH2 (also called VEGA-1 and VEGA-2, respectively, for 
vascular endothelial growth antagonist) which contain the anti-angiogenic domain 
of TSP- 1 , a metalloproteinase domain, and a disintegrin-like domain. The present 
inventors have demonstrated that both METH1 and METH2 have anti-angiogenic 
activity. 

Thus, the present invention provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a METH1 polypeptide having the amino 
acid sequence shown in SEQ ID NO:2, which was determined by sequencing a 
cloned cDNA. The METH1 protein of the present invention shares sequence 
homology with thrombospondin-1 andpNPI. The nucleotide sequence shown in 
SEQ ID NO: 1 was obtained by sequencing a cDNA clone, which was deposited 
on January 15, 1998 at the American Type Culture Collection, 10801 University 
Boulevard, Manassas, Virginia 20 1 1 0-2209, and given accession number 20958 1 . 
The cDNA clone contained in ATCC Deposit No. 209581 contains a METH1 
sequence, encoding amino acids 1 to 950 of SEQ ID NO:2. 

The present invention also provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a METH2 polypeptide having the amino 
acid sequence shown in SEQ ID NO:4, which was partially determined by 
sequencing a cloned cDNA. The METH2 protein of the present invention shares 
sequence homology with thrombospondin-1 and pNPL The nucleotide sequence 
shown in SEQ ID NO:3 was partially obtained by sequencing a cDNA clone, 
which was deposited on January 15, 1998 at the American Type Culture 
Collection, 10801 University Boulevard, Manassas, Virginia 20110-2209, and 
given accession number 209582. The cDNA clone contained in ATCC Deposit 
No. 209582 contains a partial METH2 sequence, encoding amino acids 1 12-890 
ofSEQIDNO:4. 
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Nucleic Acid Molecules 

Some of the nucleotide sequences determined by sequencing a DNA 
molecule herein were determined using an automated DNA sequencer (such as the 
Model 373 from Applied Biosystems, Inc.), and all amino acid sequences of 
polypeptides encoded by DNA molecules determined herein were predicted by 
translation of a DNA sequence determined as above. Therefore, as is known in 
the art for any DNA sequence determined by this automated approach, any 
nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, 
more typically at least about 95% to at least about 99.9% identical to the actual 
nucleotide sequence of the sequenced DNA molecule. The actual sequence can 
be more precisely determined by other approaches including manual DNA 
sequencing methods well known in the art. As is also known in the art, a single 
insertion or deletion in a determined nucleotide sequence compared to the actual 
sequence will cause a frame shift in translation of the nucleotide sequence such 
that the predicted amino acid sequence encoded by a determined nucleotide 
sequence will be completely different from the amino acid sequence actually 
encoded by the sequenced DNA molecule, beginning at the point of such an 
insertion or deletion. 

Using the information provided herein, such as the nucleotide sequence in 
SEQ ID NO: 1 or SEQ ID NO:3, a nucleic acid molecule of the present invention 
encoding a METH1 or METH2 polypeptide may be obtained using standard 
cloning and screening procedures, such as those for cloning cDNAs using mRNA 
as starting material. Illustrative of the invention, the nucleic acid molecule 
described in SEQ ID NO: 1 was discovered in a cDNA library derived from human 
heart and the nucleic acid molecule described in SEQ ED NO:3 was discovered in 
a cDNA library derived from human lung. The determined nucleotide sequence 
of the METH1 cDNA of SEQ ID NO: 1 contains an open reading frame encoding 
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a protein of about 950 amino acid residues, including a predicted leader sequence 
of about 28 amino acid residues. The present inventors have determined that the 
nucleotide sequence of the METH2 cDNA of SEQ ID NO: 3 contains an open 
reading frame encoding a protein of about 890 amino acid residues, including a 
predicted leader sequence of about 23 amino acid residues. 

The present invention also provides the mature form(s) of the METH 1 and 
METH2 proteins of the present invention. According to the signal hypothesis, 
proteins secreted by mammalian cells have a signal or secretory leader sequence 
which is cleaved from the mature protein once export of the growing protein chain 
across the rough endoplasmic reticulum has been initiated. Most mammalian cells 
and even insect cells cleave secreted proteins with the same specificity. However, 
in some cases, cleavage of a secreted protein is not entirely uniform, which results 
in two or more mature species on the protein. Further, it has long been known 
that the cleavage specificity of a secreted protein is ultimately determined by the 
primary structure of the complete protein, that is, it is inherent in the amino acid 
sequence of the polypeptide. Therefore, the present invention provides a 
nucleotide sequence encoding the mature METH1 polypeptide having the amino 
acid sequence encoded by the cDNA clone contained in the host identified as 
ATCC Deposit No. 209581 and as shown in SEQ ID NO:2. The present 
invention also provides a nucleotide sequence encoding the mature METH2 
polypeptide having the amino acid sequence as shown in SEQ ID NO:4. By the 
mature METH1 protein having the amino acid sequence encoded by the cDNA 
clone contained in the host identified as ATCC Deposit No. 20958 1 is meant the 
mature form(s) of the METH1 protein produced by expression in a mammalian 
cell (e.g., COS cells, as described below) of the complete open reading frame 
encoded by the human DNA sequence of the clone contained in the vector in the 
deposited host. As indicated below, the mature METH1 having the amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 209581 
may or may not differ from the predicted "mature" METH 1 protein shown in SEQ 
ID NO:2 (amino acids from about 29 to about 950) depending on the accuracy of 
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the predicted cleavage site based on computer analysis; and the mature METH2 
may or may not differ from the predicted "mature" METH2 protein shown in SEQ 
ID NO: 4 (amino acids from about 24 to about 890) depending on the accuracy 
of the predicted cleavage site based on computer analysis. 

Methods for predicting whether a protein has a secretory leader as well as 
the cleavage point for that leader sequence are available. For instance, the 
methods of McGeoch (Virus Res. 3:271-286 (1985)) and von Heinje (Nucleic 
Acids Res. 74:4683-4690 (1986)) can be used. The accuracy of predicting the 
cleavage points ofknown mammalian secretory proteins for each of these methods 
is in the range of 75-80%. von Heinje, supra. However, the two methods do not 
always produce the same predicted cleavage point(s) for a given protein. 

In the present case, the predicted amino acid sequence of the complete 
- METH1 and METH2 polypeptides of the present invention were analyzed by a 
computer program ("PSORT") (K Nakai and M Kanehisa, Genomics 74:897-9 1 1 
(1992)), which is an expert system for predicting the cellular location of a protein 
based on the amino acid sequence. As part of this computational prediction of 
localization, the methods of McGeoch and von Heinje are incorporated. The 
analysis by the PSORT program predicted the cleavage site between amino acids 
28 and 29 in SEQ ID NO:2 and amino acids 23 and 24 in SEQ ID NO:4. 
Thereafter, the complete amino acid sequences were further analyzed by visual 
inspection, applying a simple form of the (-1 ,-3) rule of von Heinje. von Heinje, 
supra. Thus, the leader sequence for the METH1 protein is predicted to consist 
of amino acid residues from about 1 to about 28 in SEQ ID NO:2, while the 
mature METH1 protein is predicted to consist of residues from about 29 to about 
950; and the leader sequence for the METH2 protein is predicted to consist of 
amino acid residues from about 1 to about 23 in SEQ ID NO:4, while the mature 
METH2 protein is predicted to consist of residues from about 24 to about 890. 
An alternative predicted mature METH1 protein consists of residues 30 to 950 in 
SEQ ID NO:2. 
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As one of ordinary skill would appreciate, due to the possibilities of 
sequencing errors, as well as the variability of cleavage sites for leaders in different 
known proteins, the predicted METH1 polypeptide encoded by the deposited 
cDNA comprises about 950 amino acids, but may be anywhere in the range of 
910-990 amino acids; and the predicted leader sequence of this protein is about 
28 amino acids, but may be anywhere in the range of about 1 8 to about 38 amino 
acids. Also, the predicted METH2 polypeptide comprises about 890 amino acids, 
but may be anywhere in the range of 850 to about 930 amino acids; and the 
predicted leader sequence of this protein is about 23 amino acids, but may be 
anywhere in the range of about 13 to about 33 amino acids. 

As indicated, nucleic acid molecules of the present invention may be in the 
form of RNA, such as mRNA, or in the form of DNA, including, for instance, 
cDNA and genomic DNA obtained by cloning or produced synthetically. The 
DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA 
may be the coding strand, also known as the sense strand, or it may be the 
non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. For 
example, recombinant DNA molecules contained in a vector are considered 
isolated for the purposes of the present invention. Further examples of isolated 
DNA molecules include recombinant DNA molecules maintained in heterologous 
host cells or purified (partially or substantially) DNA molecules in solution. 
Isolated RNA molecules include in vivo or in vitro RNA transcripts of the DNA 
molecules of the present invention. Isolated nucleic acid molecules according to 
the present invention further include such molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) shown in SEQ ID NO:l; 
DNA molecules comprising the coding sequence for the mature METH1 protein; 
and DNA molecules which comprise a sequence substantially different from those 
described above but which, due to the degeneracy of the genetic code, still encode 
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the METH1 protein. Also included are DNA molecules comprising an open 
reading frame (ORF) shown in SEQ ID NO:3; DNA molecules comprising the 
coding sequence for the mature METH2 protein; and DNA molecules which 
comprise a sequence substantially different from those described above but which, 
due to the degeneracy of. the genetic code, still encode the METH2 protein. Of 
course, the genetic code is well known in the art. Thus, it would be routine for 
one skilled in the art to generate such degenerate variants. 

In another aspect, the invention provides isolated nucleic acid molecules 
encoding the METH1 or METH2 polypeptides having an amino acid sequence as 
encoded by the cDNA clones contained in the plasmids deposited as ATCC 
Deposit No. 209581 on January 15, 1998 or ATCC Deposit No. 209582 on 
January 1 5, 1998, respectively. In a further embodiment, nucleic acid molecules 
are provided encoding the mature METH1 or METH2 polypeptide or the full- 
length METH1 or METH2 polypeptide lacking the N-terminal methionine. The 
invention also provides an isolated nucleic acid molecule having the nucleotide 
sequence shown in SEQ ID NO: 1 or SEQ ID NO:3 or the nucleotide sequence of 
the METH 1 or METH2 cDN A contained in the above-described deposited clones, 
or a nucleic acid molecule having a sequence complementary to one of the above 
sequences. Such isolated molecules, particularly DNA molecules, are useful as 
probes for gene mapping, by in situ hybridization with chromosomes, and for 
detecting expression of the METH1 or METH2 gene in human tissue, for 
instance, by Northern blot analysis. 

The present invention is further directed to fragments of the isolated 
nucleic acid molecules described herein. By a fragment of an isolated nucleic acid 
molecule having the nucleotide sequence of the deposited cDNA or the nucleotide 
sequence shown in SEQ ID NO: 1 or SEQ ID NO:3 is intended fragments at least 
about 1 5 nt, and more preferably at least about 20 nt, still more preferably at least 
about 30 nt, and even more preferably, at least about 40 nt in length which are 
useful as diagnostic probes and primers as discussed herein. Of course, larger 
fragments 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 
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750, 800, 850, 900, 950, 1000, 1050, 1 100, 1200, 1300, 1400, 1500, 1600, 1700, 
1800, 1900, 2000, 2100, 2200, 2300, 2400, 2500, 2600, 2700, 2800, 2900, or 
3000 nt in length are also useful according to the present invention as are 
fragments corresponding to most, if not all, of the nucleotide sequence of the 
deposited cDNA or as shown in SEQ ID NO: 1 or SEQ ID NO:3 . By a fragment 
at least 20 nt in length, for example, is intended fragments which include 20 or 
more contiguous bases from the nucleotide sequence of the deposited cDNA or 
the nucleotide sequence as shown in SEQ ED NO: 1 or SEQ ID NO:3. 

Preferred nucleic acid fragments of the present invention include nucleic 
acid molecules encoding epitope-bearing portions of the METH1 or METH2 
protein. Methods for determining epitope-bearing portions of the METH1 and 
METH2 proteins are described in detail below. 

Other preferred nucleic acid fragments of the present invention include 
nucleic acid molecules encoding: the metalloprotease domain of METH1, amino 
acids 235 to 459 in SEQ ID NO:2; the disintegrin domain of METH1, amino 
acids 460 to 544 in SEQ ID NO:2; the first TSP-like domain of METH1, amino 
acids 545 to 598 in SEQ ID NO:2; the second TSP-like domain of METH1 , amino 
acids 841 to 894 in SEQ ID NO:2; the third TSP-like domain of METH1, amino 
acids 895 to 934 in SEQ ID NO:2; amino acids 536 to 613 in SEQ ID NO:2; 
amino acids 549 to 563 in SEQ ID NO:2; the metalloprotease domain of METH2, 
amino acids 214 to 439 in SEQ ID NO:4; the disintegrin domain of METH2, 
amino acids 440 to 529 in SEQ ID NO:4; the first TSP-like domain of METH2, 
amino acids 530 to 583 in SEQ ID NO:4; the second TSP-like domain of METH2, 
amino acids 837 to 890 in SEQ ID NO:4; amino acids 280 to 606 in SEQ ID 
NO:4; and amino acids 529 to 548 in SEQ ID NO:4. 

In addition, the present inventors have identified the following cDNA 
clones related to portions of the sequence shown in SEQ ID NO: 1 : HOUCQ 1 7RA 
(SEQ ID NO:14), HPLBM1 1R (SEQ ID NO:15), HGBI07R (SEQ ID NO:16), 
HNTMA49R (SEQ ID NO: 1 7), HNALE27R (SEQ ID NO: 1 8), and HIBDB45R 
(SEQ ID NO: 19). 
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The following public ESTs, which relate to portions of SEQ ID NO:l, 
have also been identified: D67076 (SEQ ID NO:20), AB001735 (SEQ ID 
NO:21), X14787 (SEQ ID NO:22), U64857 (SEQ ID NO:23), X04665 (SEQ ID 
NO:24), M64866 (SEQ IDNO:25), L07803 (SEQ ID NO:26), U08006 (SEQ ID 
NO:27), M16974 (SEQ ID NO:28), L13855 (SEQ ID NO:29), AL021529 (SEQ 
ID NO:30), D86074 (SEQ ID NO:3 1), L05390 (SEQ ID NO:32), Z6936I (SEQ 
ID NO:33), X99599 (SEQ ID NO:34), AF018073 (SEQ ID NO:35), L23760 
(SEQ ID NO:36), Z46970 (SEQ ID NO:37), AC004449 (SEQ ID NO:38), 
Z69589 (SEQ ID NO:39), Z22279 (SEQ ED NO:40), and X17524 (SEQ ID 
NO:41). 

The present inventors have also identified the following cDNA clones 
related to portions of SEQ ID NO:3: HCE4D69FP02 (SEQ ED NO:42), 
HE8DB45F (SEQ ED NO:43), HKIXH64R (SEQ DO NO:44), HEBDB45R (SEQ 
DO NO: 19), HCE3Z95R (SEQ ED NO:45), HTLEQ90R (SEQ ID NO.46), 
HMWEF45R (SEQ ED NO:47), HTOFC34RA (SEQ DO NO:48), HHFDI20R 
(SEQ DO NO:49), HMSHY47R (SEQ DO NO:50), HCESF90R (SEQ ED NO:5 1), 
HMCA046R (SEQ DO NO:52), HTTAQ67R (SEQ ED NO:53), HFKCF19F 
(SEQ ED NO:54), HMCAS31R (SEQ ID NO:55), HMWGP26R (SEQ ED 
NO:56), HLHTP36R (SEQ ID NO:57), HE8AN11R (SEQ ED NO:58), 
HEONN73R (SEQ ID NO:59), HBNBG53R (SEQ ED NO:60), and HMSCH94R 
(SEQ ID NO:61). 

The following public ESTs, which relate to portions of the sequence 
shown in SEQ ED NO:3, have also been identified: D67076 (SEQ ED NO:20), 
AB001735 (SEQ ED NO:21), AB005287 (SEQ ED NO:62), X87619 (SEQ ED 
NO:63), XI 4787 (SEQ ED NO:22), X04665 (SEQ ED NO:24), M87276 (SEQ ED 
NO:64), M62458 (SEQ ED NO:65), AB002364 (SEQ ED NO:66), AB005297 
(SEQ ED NO:67), X69161 (SEQ ED NO:68), X16619 (SEQ ED NO:69), 136448 
(SEQ ID NO:70), L12260 (SEQ DO NO:71), 136352 (SEQ DO NO:72), X15898 
(SEQ ID NO:73), 107789 (SEQ DO NO:74), 108144 (SEQ ED NO:75) U31814 
(SEQ DO NO:76), and AF001444 (SEQ ID NO:77). 
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In specific embodiments, the polynucleotides of the invention are less than 
300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, or 7.5 kb in length. In a further 
embodiment, polynucleotides of the invention comprise at least 15 contiguous 
nucleotides of METH1 or METH2 coding sequence, but do not comprise all or 
a portion of any METH1 or METH2 intron. In another embodiment, the nucleic 
acid comprising METH1 or METH2 coding sequence does not contain coding 
sequences of a genomic flanking gene (i.e., 5' or 3* to the METH1 or METH2 
gene in the genome). 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a portion of the polynucleotide in a nucleic acid molecule of the 
invention described above, for instance, the cDNA clones contained in ATCC 
Deposit No. 209581 or ATCC Deposit No. 209582. By "stringent hybridization 
conditions" is intended overnight incubation at 42 °C in a solution comprising: 
50% formamide, 5x SSC (750 mM NaCl, 75mM trisodium citrate), 50 mM 
sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 
20 jig/ml denatured, sheared salmon sperm DNA, followed by washing the filters 
in0.1xSSCatabout65°C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide 
is intended a polynucleotide (either DNA or RNA) hybridizing to at least about 
1 5 nucleotides (nt), and more preferably at least about 20 nt, still more preferably 
at least about 30 nt, and even more preferably about 30, 40, 50, 60 or 70 nt of the 
reference polynucleotide. These are useful as diagnostic probes and primers as 
discussed above and in more detail below. 

By a portion of a polynucleotide of "at least 20 nt in length," for example, 
is intended 20 or more contiguous nucleotides from the nucleotide sequence of the 
reference polynucleotide (e.g., the deposited cDNAs or the nucleotide sequence 
as shown in SEQ ID NO: 1 or SEQ ID NO:3). Of course, a polynucleotide which 
hybridizes only to a poly A sequence (such as the 3 ' terminal poly(A) tract of the 
METH1 or METH2 cDNA shown in SEQ ID NO:l and SEQ ID NO:3, 
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respectively) or to a complementary stretch of T (or U) resides, would not be 
included in a polynucleotide of the invention used to hybridize to a portion of a 
nucleic acid of the invention, since such a polynucleotide would hybridize to any 
nucleic acid molecule containing a poly (A) stretch or the complement thereof 
(e.g., practically any double-stranded cDNA done). 

Also contemplated are nucleic acid molecules that hybridize to the METH1 
or METH2 polynucleotides at moderately high stringency hybridization 
conditions. Changes in the stringency of hybridization and signal detection are 
primarily accomplished through the manipulation of formamide concentration 
(lower percentages of formamide result in lowered stringency); salt conditions, or 
temperature. For example, moderately high stringency conditions include an 
overnight incubation at 37 degree C in a solution comprising 6X SSPE (20X 
SSPE = 3M NaCl; 0.2M NaH^O^ 0.02M EDTA, pH 7.4), 0.5% SDS, 30% 
formamide, 100 |ig/ml salmon sperm blocking DNA; followed by washes at 50 °C 
with 1XSSPE, 0. 1% SDS. In addition, to achieve even lower stringency, washes 
performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through 
the inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available proprietary formulations. The inclusion of specific 
blocking reagents may require modification of the hybridization conditions 
described above, due to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences 
(such as any 3Vterminal polyA+ tract of a cDNA shown in the sequence listing), 
or to a complementary stretch of T (or U) residues, would not be included in the 
definition of "polynucleotide," since such a polynucleotide would hybridize to any 
nucleic acid molecule containing a poly (A) stretch or the complement thereof 
(e.g., practically any double-stranded cDNA clone). 
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The METH1 or METH2 polynucleotide can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or 
DNA or modified RNA or DNA. For example, METH1 or METH2 
polynucleotides can be composed of single- and double-stranded DNA, DNA that 
is a mixture of single- and double-stranded regions, single- and double-stranded 
RNA, and RNA that is mixture of single- and double-stranded regions, hybrid 
molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded or a mixture of single- and double-stranded regions. In 
addition, the METH1 or METH2 polynucleotides can be composed of triple- 
stranded regions comprising RNA or DNA or both RNA and DNA. METH1 or 
METH2 polynucleotides may also contain one or more modified bases or DNA 
or RNA backbones modified for stability or for other reasons. "Modified" bases 
include, for example, tritylated bases and unusual bases such as inosine. A variety 
of modifications can be made to DNA and RNA; thus, "polynucleotide" embraces 
chemically, enzymatically, or metabolically modified forms. 

"SEQ ID NO: 1 " refers to a METH1 polynucleotide sequence while "SEQ 
ID NO:2" refers to a METH1 polypeptide sequence. "SEQ ID NO:3" refers to 
a METH2 polynucleotide sequence while "SEQ ID NO:4" refers to a METH2 
polypeptide sequence. 

As indicated, nucleic acid molecules of the present invention which encode 
a METH1 or METH2 polypeptide may include, but are not limited to, those 
encoding the amino acid sequence of the mature polypeptide, by itself, the coding 
sequence for the mature polypeptide and additional sequences, such as those 
encoding the leader or secretory sequence, such as a pre-, or pro- or prepro- 
protein sequence; the coding sequence of the mature polypeptide, with or without 
the aforementioned additional coding sequences, together with additional, 
non-coding sequences, including for example, but not limited to introns and 
non-coding 5 ' and 3 9 sequences, such as the transcribed, non-translated sequences 
that play a role in transcription, mRNA processing, including splicing and 
polyadenylation signals, for example - ribosome binding and stability of mRNA; 
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an additional coding sequence which codes for additional amino acids, such as 
those which provide additional functionalities. Thus, the sequence encoding the 
polypeptide may be fused to a marker sequence, such as a sequence encoding a 
peptide which facilitates purification of the fused polypeptide. In certain preferred 
embodiments of this aspect of the invention, the marker amino acid sequence is a 
hexa-histidine peptide, such as the tag provided in a pQE vector (Qiagen, Inc.), 
among others, many of which are commercially available. As described in Gentz 
etal t Proc. Natl Acad Sci. USA #6:821-824 (1989), for instance, hexa-histidine 
provides for convenient purification of the fusion protein. The "HA" tag is 
another peptide useful for purification which corresponds to an epitope derived 
from the influenza hemagglutinin protein, which has been described by Wilson et 
aL, Cell 37:161-11% (1984). As discussed below, other such fusion proteins 
include the METH1 or METH2 fused to Fc at the N- or C-terminus. 

The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode portions, analogs or derivatives 
of the METH 1 or METH2 protein. Variants may occur naturally, such as a natural 
allelic variant. By an "allelic variant" is intended one of several alternate forms of 
a gene occupying a given locus on a chromosome of an organism. Lewin, B., ed., 
Genes II, John Wiley & Sons, New York (1985). Non-naturally occurring 
variants may be produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions 
or additions, which may involve one or more nucleotides. The variants may be 
altered in coding regions, non-coding regions, or both. Alterations in the coding 
regions may produce conservative or non-conservative amino acid substitutions, 
deletions or additions. Especially preferred among these are silent substitutions, 
additions and deletions, which do not alter the properties and activities of the 
METH1 or METH2 protein or portions thereof Also especially preferred in this 
regard are conservative substitutions. 

Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 95% 
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identical, and more preferably at leiast 96%, 97%, 98% or 99% identical to: a 
nucleotide sequence encoding the polypeptide having the amino acid sequence in 
SEQ ID NO:2; a nucleotide sequence encoding the polypeptide having the amino 
acid sequence in SEQ ID NO:2, but lacking the N-terminal methionine; a 
nucleotide sequence encoding the polypeptide having the amino acid sequence at 
positions from about 29 to about 950 in SEQ ID NO:2; a nucleotide sequence 
encoding the polypeptide having the amino acid sequence at position from about 
30 to about 950 in SEQ ID NO:2; a nucleotide sequence encoding the polypeptide 
having the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 209581; a nucleotide sequence encoding the mature METH1 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 20958 1 ; a nucleotide sequence encoding amino 
acids 235 to 459 in SEQ ID NO:2 (the metalloprotease domain of METH1); a 
nucleotide sequence encoding amino acids 460 to 544 in SEQ ID NO: 2 (the 
disintegrin domain of METH1); a nucleotide sequence encoding amino acids 545 
to 598 in SEQ ID NO:2 (the first TSP-like domain of METH1); a nucleotide 
sequence encoding amino acids 841 to 894 in SEQ ID NO:2 (the second TSP-like 
domain of METH1); a nucleotide sequence encoding amino acids 895 to 934 in 
SEQ ED NO:2 (the third TSP-like domain of METH1); a nucleotide sequence 
encoding amino acids 536 to 613 in SEQ ID NO:2; a nucleotide sequence 
encoding amino acids 549 to 563 in SEQ ID NO:2; a nucleotide sequence 
encoding the polypeptide having the amino acid sequence in SEQ ID NO:4; a 
nucleotide sequence encoding the polypeptide having the amino acid sequence in 
SEQ ID NO:4, but lacking the N-terminal methionine; a nucleotide sequence 
encoding the polypeptide having the amino acid sequence at positions from about 
24 to about 890 in SEQ ID NO:4; a nucleotide sequence encoding the polypeptide 
having the amino acid sequence at positions from about 1 12 to about 890 in SEQ 
ID NO:4; a nucleotide sequence encoding the polypeptide having the amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 209582; 
a nucleotide sequence encoding the mature METH2 polypeptide having the amino 
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acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 
209582; a nucleotide sequence encoding amino acids 21 4 to 439 in SEQ ID NO:4 
(the metalloprotease domain of METH2); a nucleotide sequence encoding amino 
acids 440 to 529 in SEQ ID NO:4 (the disintegrin domain of METH2); a 
5 nucleotide sequence encoding amino acids 530 to 583 in SEQ ID NO:4 ( the first 

TSP-like domain of METH2); a nucleotide sequence encoding amino acids 837 
to 890 in SEQ ID NO:4 (the second TSP-like domain of METH2); a nucleotide 
sequence encoding amino acids 280 to 606 in SEQ ID NO:4; a nucleotide 
sequence encoding amino acids 529 to 548 in SEQ ID NO:4; or a nucleotide 

10 sequence complementary to any of the above nucleotide sequences . 

By a polynucleotide having a nucleotide sequence at least, for example, 
95% "identical" to a reference nucleotide sequence encoding a METH1 or 
METH2 polypeptide is intended that the nucleotide sequence of the 
polynucleotide is identical to the reference sequence except that the polynucleotide 

15 sequence may include up to five point mutations per each 100 nucleotides of the 

reference nucleotide sequence encoding the METH1 or METH2 polypeptide. In 
other words, to obtain a polynucleotide having a nucleotide sequence at least 95% 
identical to a reference nucleotide sequence, up to 5% of the nucleotides in the 
reference sequence may be deleted or substituted with another nucleotide, or a 

20 number of nucleotides up to 5% of the total nucleotides in the reference sequence 

may be inserted into the reference sequence. These mutations of the reference 
sequence may occur at the 5 ' or 3 ' terminal positions of the reference nucleotide 
sequence or anywhere between those terminal positions, interspersed either 
individually among nucleotides in the reference sequence or in one or more 

25 contiguous groups within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 95%, 96%, 97%, 98% or 99% identical to, for instance, the nucleotide 
sequence shown in SEQ ED NO: 1 or SEQ ID NO: 3 or to the nucleotide sequence 
of the deposited cDNA clones can be determined conventionally using known 

30 computer programs such as the Bestfit program (Wisconsin Sequence Analysis 
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Package, Version 8 for Unix, Genetics Computer Group, University Research 
Park, 575 Science Drive, Madison, Wl 5371 1). Bestfit uses the local homology 
algorithm of Smith and Waterman, Advances in Applied Mathematics 2: 482-489 
(1981), to find the best segment of homology between two sequences. When 
using Bestfit or any other sequence alignment program to determine whether a 
particular sequence is, for instance, 95% identical to a reference sequence 
according to the present invention, the parameters are set, of course, such that the 
percentage of identity is calculated over the full length of the reference nucleotide 
sequence and that gaps in homology of up to 5% of the total number of 
nucleotides in the reference sequence are allowed. 

A preferred method for determining the best overall match between a 
query sequence (a sequence of the present invention) and a subject sequence, also 
referred to as a global sequence alignment, can be determined using the FASTDB 
computer program based on the algorithm of Brutlag et al y Comp. Appl. Bioscl 
6:237-245 (1 990). In a sequence alignment, the query and subject sequences are 
both DNA sequences. An RNA sequence can be compared by converting U's to 
Ts. The result of said global sequence alignment is in percent identity. Preferred 
parameters used in a FASTDB alignment of DNA sequences to calculate percent 
identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty^ 1 , Joining Penalty=3 0, 
Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size 
Penalty=0.05, Window Size=500 or the length of the subject nucleotide sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 
3' deletions, not because of internal deletions, a manual correction must be made 
to the results. This is because the FASTDB program does not account for 5* and 
3' truncations of the subject sequence when calculating percent identity. For 
subject sequences truncated at the 5 1 or 3' ends, relative to the query sequence, the 
percent identity is corrected by calculating the number of bases of the query 
sequence that are 5* and 3* of the subject sequence, which are not matched/aligned, 
as a percent of the total bases of the query sequence. Whether a nucleotide is 
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matched/aligned is determined by the results of the FASTDB sequence alignment. 
This percentage is then subtracted from the percent identity, calculated by the 
above FASTDB program using the specified parameters, to arrive at a final 
percent identity score. This corrected score is what is used for the purposes of the 
present invention. Only bases outside the 5' and 3 1 bases of the subject sequence, 
as displayed by the FASTDB alignment, which are not matched/aligned with the 
query sequence are calculated for the purposes of manually adjusting the percent 
identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determine percent identity. The deletions occur at the 5* end of the 
subject sequence and, therefore, the FASTDB alignment does not show a 
match/alignment of the first 10 bases at the 5' end. The 10 unpaired bases 
represent 10% of the sequence (number of bases at the 5' and 3* ends not 
matched/total number of bases in the query sequence), so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 
90 bases were perfectly matched the final percent identity would be 90%. In 
another example, a 90 base subject sequence is compared with a 100 base query 
sequence. This time the deletions are internal, so that there are no bases on the 
5* or 3' ends of the subject sequence which are not matched/aligned with the query. 
In this case, the percent identity calculated by FASTDB is not manually corrected. 
One again, only bases 5' and 3* of the subject sequence which are not 
matched/aligned with the query sequence are manually corrected for. No other 
manual corrections are to be made for the purposes of the present invention. 

The present application is directed to nucleic acid molecules at least 95%, 
96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in SEQ ID 
NO: 1 or SEQ ID NO:3 or to the nucleic acid sequence of the deposited cDNAs, 
irrespective of whether they encode a polypeptide having METH1 or METH2 
activity. This is because even where a particular nucleic acid molecule does not 
encode a polypeptide having METH1 or METH2 activity, one of skill in the art 
would still know how to use the nucleic acid molecule, for instance, as a 
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hybridization probe or a polymerase chain reaction (PCR) primer. Uses of the 
nucleic acid molecules of the present invention that do not encode a polypeptide 
having METH1 or METH2 activity include, inter alia, (1) isolating the METH1 
or METH2 gene or allelic variants thereof in a cDNA library; (2) in situ 
hybridization (e.g. , "FISH") to metaphase chromosomal spreads to provide precise 
chromosomal location of the METH1 or METH2 gene, as described in Verma et 
al , Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New 
York (1988); and (3) Northern Blot analysis for detecting METH1 or METH2 
mRNA expression in specific tissues. 

Preferred, however, are nucleic acid molecules having sequences at least 
95%, 96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in 
SEQ ID NO: 1 or SEQ ID NO:3 or to a nucleic acid sequence of the deposited 
cDNAs which do, in fact, encode a polypeptide having METH1 or METH2 
protein activity. By "a polypeptide having METH1 activity" is intended 
polypeptides exhibiting METH1 activity in a particular biological assay. For 
example, METH1 protein activity can be measured using the chorioallantoic 
membrane assay (Iruela-Arispe et al, Thrombosis and Haemostasis 78(1):672- 
677 (1 997)) or the cornea pocket assay (Tolsma et al t J. Cell. Biol 122:497-5 1 1 
(1993)), both described in Example 4, below. By "a polypeptide having METH2 
activity" is intended polypeptides exhibiting METH2 activity in a particular 
biological assay. For example, METH2 protein activity can also be measured 
using the chorioallantoic membrane assay (Iruela-Arispe et al., Thrombosis and 
Haemostasis 78(1) :672-677 (1 997)) or the cornea pocket assay (Tolsma et al , J. 
Cell Biol 722:497-51 1 (1993)), both described in Example 4, below. 

Briefly, in the chorioallantoic assay, the potentially anti-angiogenic 
compound of interest is added to type I collagen pellets (Vitrogen), along with an 
angiogenic growth factor, such as bFGF. The samples are mixed and placed onto 
nylon meshes, and allowed to polymerize. After polymerization is complete, the 
meshes are placed onto the chorioallantoic membrane of 1 2 day old chick embryos 
and placed at 37°C for 24 hours. The embryos then injected with a fluorescent 
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agent, such as FITC-dextran, and the meshes are fixed and mounted for 
observation under a fluorescent microscope. 

In the cornea pocket assay, hydron pellets containing the compound of 
interest and an angiogenic growth factor, such as bFGF, are implanted 1 to 2mm 
from the limbus of the cornea of rats or mice. Response is examined after a period 
of time, for example 5 days. The extent of angiogenesis is evaluated by measuring 
the capillaries migrating from the limb of the cornea. 

Of course, due to the degeneracy of the genetic code, one of ordinary skill 
in the art will immediately recognize that a large number of the nucleic acid 
molecules having a sequence at least 95%, 96%, 97%, 98%, or 99% identical to 
a nucleic acid sequence of the deposited cDNAs or a nucleic acid sequence shown 
in SEQ ID NO: 1 or SEQ ID NO:3 will encode a polypeptide "having METH1 or 
METH2 protein activity." In fact, since degenerate variants of these nucleotide 
sequences all encode the same polypeptide, this will be clear to the skilled artisan 
even without performing the above described comparison assay. It will be further 
recognized in the art that, for such nucleic acid molecules that are not degenerate 
variants, a reasonable number will also encode a polypeptide having METH1 or 
METH2 protein activity. This is because the skilled artisan is fully aware of amino 
acid substitutions that are either less likely or not likely to significantly effect 
protein function (e.g., replacing one aliphatic amino acid with a second aliphatic 
amino acid). 

For example, guidance concerning how to make phenotypically silent 
amino acid substitutions is provided in Bowie, J. U. et aL, "Deciphering the 
Message in Protein Sequences: Tolerance to Amino Acid Substitutions," Science 
24 7: 1 3 06- 1 3 1 0 ( 1 990), wherein the authors indicate that proteins are surprisingly 
tolerant of amino acid substitutions. 
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Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of METH1 or 
METH2 polypeptides or fragments thereof by recombinant techniques. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in a 
precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 
appropriate packaging cell line and then transduced into host cells. 

The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the 
S V40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 
constructs will further contain sites for transcription initiation, termination and, in 
the transcribed region, a ribosome binding site for translation. The coding portion 
of the mature transcripts expressed by the constructs will preferably include a 
translation initiating at the beginning and a termination codon (UAA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline or ampicillin resistance genes 
for culturing in E. coli and other bacteria. Representative examples of appropriate 
hosts include, but are not limited to, bacterial cells, such as E. coli, Streptomyces 
and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect cells 
such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 
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Among vectors preferred for use in bacteria include pQE70, pQE60 and 
pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript 
vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from Stratagene; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia. 
Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL 
available from Pharmacia. Other suitable vectors will be readily apparent to the 
skilled artisan. 

In addition to the use of expression vectors in the practice of the present 
invention, the present invention farther includes novel expression vectors 
comprising operator and promoter elements operatively linked to nucleotide 
sequences encoding a protein of interest. One example of such a vector is pHE4-5 
which is described in detail below. 

As summarized in Figures 8 and 9, components of the pHE4-5 vector 
(SEQ ID NO: 12) include: 1) a neomycinphosphotransferase gene as a selection 
marker, 2) an E. coli origin of replication, 3) a T5 phage promoter sequence, 
4) two lac operator sequences, 5) a Shine-Delgarno sequence, 6) the lactose 
operon repressor gene (laclq). The origin of replication (oriC) is derived from 
pUC19 (LTI, Gaithersburg, MD). The promoter sequence and operator 
sequences were made synthetically. Synthetic production of nucleic acid 
sequences is well known in the art. CLONTECH 95/96 Catalog, pages 215-21 6, 
CLONTECH, 1020 East Meadow Circle, Palo Alto, CA 94303. A nucleotide 
sequence encoding METH1 (SEQ ED NO:2) or METH2 (SEQ ED NO:4), is 
operatively linked to the promoter and operator by inserting the nucleotide 
sequence between the Ndel and Asp718 sites of the pHE4-5 vector. 

As noted above, the pHE4-5 vector contains a laclq gene. Laclq is an 
allele of the loci gene which confers tight regulation of the lac operator. Amann, 
E. et al % Gene 69:301-315 (1988); Stark, M, Gene 57:255-267 (1987). The 
laclq gene encodes a repressor protein which binds to lac operator sequences and 
blocks transcription of down-stream (/. e., 3') sequences. However, the laclq gene 
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product dissociates from the lac operator in the presence of either lactose or 
certain lactose analogs, e.g., isopropyl B-D-thiogalactopyrandside (IPTG). 
METH1 or METH2 thus is not produced in appreciable quantities in uninduced 
host cells containing the pHE4-5 vector. Induction of these host cells by the 
addition of an agent such as IPTG, however, results in the expression of the 
METH1 or METH2 coding sequence. 

The promoter/operator sequences of the pHE4-5 vector (SEQ ID NO : 1 3) 
comprise a T5 phage promoter and two lac operator sequences. One operator is 
located 5* to the transcriptional start site and the other is located 3 1 to the same 
site. These operators, when present in combination with the laclq gene product, 
confer tight repression of down-stream sequences in the absence of a lac operon 
inducer, e.g., IPTG. Expression of operatively linked sequences located down- 
stream from the lac operators may be induced by the addition of a lac operon 
inducer, such as IPTG. Binding of a lac inducer to the laclq proteins results in 
their release from the lac operator sequences and the initiation of transcription of 
operatively linked sequences. Lac operon regulation of gene expression is 
reviewed in Devlin, T., TEXTBOOK OF BIOCHEMISTRY WITH CLINICAL 
Correlations, 4th Edition (1997), pages 802-807. 

The pHE4 series of vectors contain all of the components of the pHE4-5 
vector except for the METH1 or METH2 coding sequence. Features of the pHE4 
vectors include optimized synthetic T5 phage promoter, lac operator, and 
Shine-Delgarno sequences. Further, these sequences are also optimally spaced so 
that expression of an inserted gene may be tightly regulated and high level of 
expression occurs upon induction. 

Among known bacterial promoters suitable for use in the production of 
proteins of the present invention include the E. coli lad and lacZ promoters, the 
T3 and T7 promoters, thegpt promoter, the lambda PR and PL promoters and the 
trp promoter. Suitable eukaryotic promoters include the CMV immediate early 
promoter, the HSV thymidine kinase promoter, the early and late SV40 
promoters, the promoters of retroviral LTRs, such as those of the Rous Sarcoma 
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Virus (RS V), and metallothionein promoters, such as the mouse metallothionein-I 
promoter. 

The pHE4-5 vector also contains a Shine-Delgarno sequence 5' to the 
AUG initiation codon. Shine-Delgarno sequences are short sequences generally 
located about 10 nucleotides up-stream (£«., 5') from the AUG initiation codon. 
These sequences essentially direct prokaryotic ribosomes to the AUG initiation 
codon. 

Thus, the present invention is also directed to expression vector useful for 
the production of the proteins of the present invention. This aspect of the 
invention is exemplified by the pHE4-5 vector (SEQ ID NO: 12). 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic 
lipid-mediated transfection, electroporation, transduction, infection or other 
methods. Such methods are described in many standard laboratory manuals, such 
as Davis et al. t Basic Methods In Molecular Biology (1 986). 

The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional 
heterologous functional regions. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties 
may be added to the polypeptide to facilitate purification. Such regions may be 
removed prior to final preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or excretion, to improve stability 
and to facilitate purification, among others, are familiar and routine techniques in 
the art. A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize proteins. For example, EP-A-O 464 
533 (Canadian counterpart 2045869) discloses fusion proteins comprising various 
portions of constant region of immunoglobin molecules together with another 
human protein or part thereof In many cases, the Fc part in a fusion protein is 
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thoroughly advantageous for use in therapy and diagnosis and thus results, for 
example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 
hand, for some uses it would be desirable to be able to delete the Fc part after the 
fusion protein has been expressed, detected and purified in the advantageous 
manner described. This is the case when the Fc portion proves to be a hindrance 
to use in therapy and diagnosis, for example when the fusion protein is to be used 
as an antigen for immunizations. In drug discovery, for example, human proteins, 
such as the hIL5-receptor, have been fused with Fc portions for the purpose of 
high-throughput screening assays to identify antagonists of hIL-5. See, D. 
Bennett et a/., J. Mol Recognition 5:52-58 (1995) and K. Johanson e/ al y J. of 
Biol Chem. 270(7^:9459-9471 (1995). 

The METH1 or METH2 protein can be recovered and purified from 
recombinant cell cultures by well-known methods including ammonium sulfate or 
ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography 
("HPLC") is employed for purification. Polypeptides of the present invention 
include naturally purified products, products of chemical synthetic procedures, and 
products produced by recombinant techniques from a prokaryotic or eukaryotic 
host, including, for example, bacterial, yeast, higher plant, insect and mammalian 
cells. Depending upon the host employed in a recombinant production procedure, 
the polypeptides of the present invention may be glycosylated or may be 
non-glycosylated. In addition, polypeptides of the invention may also include an 
initial modified methionine residue, in some cases as a result of host-mediated 
processes. 
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METH1 and METH2 Polypeptides and Fragments 

The invention further provides an isolated METH 1 polypeptide having the 
amino acid sequence encoded by the deposited cDNA, or the amino acid sequence 
in SEQ ID NO:2, or a peptide or polypeptide comprising a portion of the above 
polypeptides. The invention also provides an isolated METH2 polypeptide having 
the amino acid sequence encoded by the deposited cDNA, or the amino acid 
sequence in SEQ ID NO:4, or a peptide or polypeptide comprising a portion of 
the above polypeptides. 

METH1 or METH2 polypeptides can be composed of amino acids joined 
to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, 
and may contain amino acids other than the 20 gene-encoded amino acids. The 
METH1 or METH2 polypeptides may be modified by either natural processes, 
such as posttranslational processing, or by chemical modification techniques which 
are well known in the art. Such modifications are well described in basic texts and 
in more detailed monographs, as well as in a voluminous research literature. 
Modifications can occur anywhere in the METH1 or METH2 polypeptide, 
including the peptide backbone, the amino acid side-chains and the amino or 
carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given METH1 or 
METH2 polypeptide. Also, a given METH1 or METH2 polypeptide may contain 
many types of modifications. METH 1 or METH2 polypeptides may be branched 
, for example, as a result of ubiquitination, and they may be cyclic, with or without 
branching. Cyclic, branched, and branched cyclic METH1 or METH2 
polypeptides may result from posttranslation natural processes or may be made by 
synthetic methods. Modifications include acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a 
heme moiety, covalent attachment of a nucleotide or nucleotide derivative, 
covalent attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, 
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demethylation, formation of covalent cross-links, formation of cysteine, formation 
of pyroghitamate, formylation, gamma-carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, 
W. H. Freeman and Company, New York (1993); POSTTRANSLATIONAL 
COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed , Academic 
Press, New York, pgs. 1-12 (1983); Seifter et al y Meth Ertzymol 752:626-646 
(1990); Rattan et al, Ann NYAcadSci 665:48-62 (1992).) 

It will be recognized in the art that some amino acid sequences of the 
METH1 and METH2 polypeptides can be varied without significant effect of the 
structure or function of the protein. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the 
protein which determine activity. 

The present inventors have shown that METH1 and METH2 inhibit 
angiogenesis in vitro and in vivo. METH1 and METH2 each contain a 
metalloprotease domain, a disintegrin domain, and TSP-like domains. The 
metalloprotease domain may be catalytically active. The disintegrin domain may 
play a role in inhibiting angiogenesis by interacting with integrins, since integrins 
are essential for the mediation of both proliferative and migratory signals. The 
present inventors have shown that peptides derived from the TSP-like domains of 
METH1 and METH2 inhibit angiogenesis in vitro and in vivo. 

Thus, the invention further includes variations of the METH 1 polypeptide 
which show substantial METH1 polypeptide activity or which include regions of 
METH1 protein such as the protein portions discussed below; and variations of 
the METH2 polypeptide which show substantial METH2 polypeptide activity or 
which include regions of METH2 protein such as the protein portions discussed 
below. Such mutants include deletions, insertions, inversions, repeats, and type 
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substitutions. As indicated above, guidance concerning which amino acid changes 
are likely to be phenotypically silent can be found in Bowie, J.U., et aL 9 
"Deciphering the Message in Protein Sequences: Tolerance to Amino Acid 
Substitutions," Science 247:1306-1310 (1990). 

Thus, the fragment, derivative or analog of the polypeptide of SEQ ID 
NO:2 or SEQ ID NO:4, or that encoded by the deposited cDNA, may be (i) one 
in which one or more of the amino acid residues are substituted with a conserved 
or non-conserved amino acid residue (preferably a conserved amino acid residue) 
and such substituted amino acid residue may or may not be one encoded by the 
genetic code, or (ii) one in which one or more of the amino acid residues includes 
a substituent group, or (iii) one in which the mature polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide 
(for example, polyethylene glycol), or (iv) one in which the additional amino acids 
are fused to the mature polypeptide, such as an IgG Fc fusion region peptide or 
leader or secretory sequence or a sequence which is employed for purification of 
the mature polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogs are deemed to be within the scope of those skilled in the art from the 
teachings herein. 

Of particular interest are substitutions of charged amino acids with another 
charged amino acid and with neutral or negatively charged amino acids. The latter 
results in proteins with reduced positive charge to improve the characteristics of 
the METH1 or METH2 proteins. The prevention of aggregation is highly 
desirable. Aggregation of proteins not only results in a loss of activity but can also 
be problematic when preparing pharmaceutical formulations, because they can be 
immunogenic. (Pinckarde/a/., Clin. Exp. Immunol. 2:33 1-340 (1967); Robbins 
et al t Diabetes 35:838-845 (1987); Cleland et al, Crit Rev. Therapeutic Drug 
Carrier Systems 70:307-377 (1993)). 

As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding 
or activity of the protein (see Table 3). 
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TABLE 3. Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleutine 




Valine 


Polar 


Glutamine 




Acnarapinp 


Basic 


Argrnine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 



Of course, the number of amino acid substitutions a skilled artisan would 
make depends on many factors, including those described above. Generally 
speaking, the number of amino acid substitutions for any given METH1 or 
METH2 polypeptide will not be more than 50, 40, 30, 20, 10, 5, or 3. 

Amino acids in the METH1 and METH2 proteins of the present invention 
that are essential for function can be identified by methods known in the art, such 
as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and 
Wells, Science 244:1081-1085 (1989)). The latter procedure introduces single 
alanine mutations at every residue in the molecule. The resulting mutant 
molecules are then tested for biological activity such as in vitro or in vivo 
inhibition of angjogenesis. Sites that are critical for inhibition of angiogenesis can 
also be determined by structural analysis such as crystallization, nuclear magnetic 
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resonance or photoaffinity labeling (Smith etal. f J. Mol Biol 224\ 899-904 ( 1 992) 
and de Vos et al t Science 255:306-312 (1992)). 

The polypeptides of the present invention are preferably provided in an 
isolated form. By "isolated polypeptide" is intended a polypeptide removed from 
5 its native environment. Thus, a polypeptide produced and/or contained within a 

recombinant host cell is considered isolated for purposes of the present invention. 
Also intended as an "isolated polypeptide" are polypeptides that have been 
purified, partially or substantially, from a recombinant host cell or from a native 
source. For example, a recombinantly produced version of the METH1 or 

10 METH2 polypeptide can be substantially purified by the one-step method 

described in Smith and Johnson, Gene 67:31-40 (1988). 

The polypeptides of the present invention include the METH 1 polypeptide 
encoded by the deposited cDNA including the leader; the mature METH1 
polypeptide encoded by the deposited the cDNA minus the leader (i.e., the mature 

15 protein); a polypeptide comprising amino acids about 1 to about 950 in SEQ ID 

NO:2; a polypeptide comprising amino acids about 2 to about 950 in SEQ ID 
NO:2; a polypeptide comprising amino acids about 29 to about 950 in SEQ ID 
NO:2; a polypeptide comprising amino acids about 30 to about 950 in SEQ ID 
NO:2; a polypeptide comprising the metalloprotease domain of METH 1, amino 

20 acids 235 to 459 in SEQ ID NO:2; a polypeptide comprising the disintegrin 

domain of METH1, amino adds 460 to 544 in SEQ ID NO:2; a polypeptide 
comprising the first TSP-like domain of METH 1 , amino acids 545 to 598 in SEQ 
ID NO:2; a polypeptide comprising the second TSP-like domain of METH1, 
amino acids 841 to 894 in SEQ ID NO:2; a polypeptide comprising the third TSP- 

25 like domain of METH1 , amino acids 895 to 934 in SEQ ID NO:2; a polypeptide 

comprising amino acids 536 to 613 in SEQ ID NO:2; a polypeptide comprising 
amino acids 549 to 563 in SEQ ID NO:2; the METH2 polypeptide encoded by the 
deposited cDNA including the leader; the mature METH2 polypeptide encoded 
by the deposited the cDNA minus the leader (i.e., the mature protein); a 

30 polypeptide comprising amino acids about 1 to about 890 in SEQ ID NO:4; a 
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polypeptide comprising amino acids about 2 to about 890 in SEQ ID NO:4; a 
polypeptide comprising amino acids about 24 to about 890 in SEQ ID NO:4; a 
polypeptide comprising amino acids about 1 12 to about 890 in SEQ ID NO:4; a 
polypeptide comprising the metalloprotease domain of METH2, amino acids 214 
to 439 in SEQ ID NO:4; a polypeptide comprising the disintegrin domain of 
METH2, amino acids 440 to 529 in SEQ ID NO:4; a polypeptide comprising the 
first TSP-Iike domain of METH2, amino acids 530 to 583 in SEQ ID NO:4; a 
polypeptide comprising the second TSP-like domain of METH2, amino acids 837 
to 890 in SEQ ID NO:4; a polypeptide comprising amino acids 280 to 606 in SEQ 
ID NO:4; a polypeptide comprising amino acids 529 to 548 in SEQ ID NO:4; as 
well as polypeptides which are at least 95% identical, and more preferably at least 
96%, 97%, 98% or 99% identical to the polypeptides described above and also 
include portions of such polypeptides with at least 30 amino acids and more 
preferably at least 50 amino acids. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
M identical M to a reference amino acid sequence of a METH1 or METH2 
polypeptide is intended that the amino acid sequence of the polypeptide is identical 
to the reference sequence except that the polypeptide sequence may include up to 
five amino acid alterations per each 100 amino acids of the reference amino acid 
of the METH1 or METH2 polypeptide. In other words, to obtain a polypeptide 
having an amino acid sequence at least 95% identical to a reference amino acid 
sequence, up to 5% of the amino acid residues in the reference sequence may be 
deleted or substituted with another amino acid, or a number of amino acids up to 
5% of the total amino acid residues in the reference sequence may be inserted into 
the reference sequence. These alterations of the reference sequence may occur at 
the amino or carboxy terminal positions of the reference amino acid sequence or 
anywhere between those terminal positions, interspersed either individually among 
residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 
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As a practical matter, whether any particular polypeptide is at least 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequence shown 
in SEQ ID NO:2 or SEQ ID NO:4 or to the amino acid sequence encoded by 
deposited cDN A clones can be determined conventionally using known computer 
programs such the Bestfit program (Wisconsin Sequence Analysis Package, 
Version 8 for Unix, Genetics Computer Group, University Research Park, 575 
Science Drive, Madison, WI 5371 1). When using Bestfit or any other sequence 
alignment program to determine whether a particular sequence is, for instance, 
95% identical to a reference sequence according to the present invention, the 
parameters are set, of course, such that the percentage of identity is calculated 
over the full length of the reference amino acid sequence and that gaps in 
homology of up to 5% of the total number of amino acid residues in the reference 
sequence are allowed. 

A preferred method for determining the best overall match between a 
query sequence (a sequence of the present invention) and a subject sequence, also 
referred to as a global sequence alignment, can be determined using the FASTDB 
computer program based on the algorithm of Brutlag et aL, Comp. App. Biosci. 
6:237-245 (1990). In a sequence alignment, the query and subject sequences are 
either both nucleotide sequences or both amino acid sequences. The result of said 
global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch 
Penalty^ 1 , Joining Penal ty=20, Randomization Group Lengtfr=0, Cutoff Score= 1 , 
Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the 
subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, a manual correction must be 
made to the results. This is because the FASTDB program does not account for 
N- and C-terminal truncations of the subject sequence when calculating global 
percent identity. For subject sequences truncated at the N- and C-termini, relative 
to the query sequence, the percent identity is corrected by calculating the number 
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of residues of the query sequence that are N- and C -terminal of the subject 
sequence, which are not matched/aligned with a corresponding subject residue, as 
a percent of the total residues of the query sequence. Whether a residue is 
matched/aligned is determined by the results of the FASTDB sequence alignment. 
This percentage is then subtracted from the percent identity, calculated by the 
above FASTDB program using the specified parameters, to arrive at a final 
percent identity score. This final percent identity score is what is used for the 
purposes of the present invention. Only residues to the N- and C-termini of the 
subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting the percent identity score. That 
is, only query residue positions outside the farthest N- and C-terminal residues of 
the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 
1 00 residue query sequence to determine percent identity. The deletion occurs at 
the N-terminus of the subject sequence and therefore, the FASTDB alignment 
does not show a match/alignment of the first 10 residues at the N-terminus. The 
1 0 unpaired residues represent 1 0% of the sequence (number of residues at the N- 
and C-termini not matched/total number of residues in the query sequence) so 
10% is subtracted from the percent identity score calculated by the FASTDB 
program. If the remaining 90 residues were perfecdy matched, the final percent 
identity would be 90%. In another example, a 90 residue subject sequence is 
compared with a 1 00 residue query sequence. This time, the deletions are internal, 
so there are no residues at the N- or C-termini of the subject sequence which are 
not matched/aligned with the query. In this case, the percent identity calculated 
by FASTDB is not manually corrected. Once again, only residue positions outside 
the N- and C-terminal ends of the subject sequence, as displayed in the FASTDB 
alignment, which are not matched/aligned with the query sequence are manually 
corrected for. No other manual corrections are made for the purposes of the 
present invention. 
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The polypeptides of the present invention are useful as a molecular weight 
marker on SDS-PAGE gels or on molecular sieve gel filtration columns using 
methods well known to those of skill in the art. 

In another aspect, the invention provides a peptide or polypeptide 
comprising an epitope-bearing portion of a polypeptide of the invention. The 
epitope of this polypeptide portion is an immunogenic or antigenic epitope of a 
polypeptide described herein. An "immunogenic epitope" is defined as a part of 
a protein that elicits an antibody response when the whole protein is the 
immunogen. On the other hand, a region of a protein molecule to which an 
antibody can bind is defined as an "antigenic epitope." The number of 
immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, for instance, Geysen et al t Proa Naif, Acad ScL USA #7:3998- 
4002(1983). 

As to the selection of peptides or polypeptides bearing an antigenic epitope 
(i.e., that contain a region of a protein molecule to which an antibody can bind), 
it is well known in that art that relatively short synthetic peptides that mimic part 
of a protein sequence are routinely capable of eliciting an antiserum that reacts 
with the partially mimicked protein. See, for instance, Sutcliffe, J. G. et al 7 
"Antibodies that react with predetermined sites on proteins", Science 219: 660-666 
(1983). Peptides capable of eliciting protein-reactive sera are frequently 
represented in the primary sequence of a protein, can be characterized by a set of 
simple chemical rules, and are confined neither to immunodominant regions of 
intact proteins (i.e., immunogenic epitopes) nor to the amino or carboxyl 
terminals. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 
specifically to a polypeptide of the invention. See, for instance, Wilson et aL, Cell 
37:767-778 (1984) at 777. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
preferably contain a sequence of at least seven, more preferably at least nine and 
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most preferably between about at least about 15 to about 30 amino acids 
contained within the amino acid sequence of a polypeptide of the invention. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means. Houghten, R. A., "General method for the 
rapid solid-phase synthesis of large numbers of peptides: specificity of 
antigen-antibody interaction at the level of individual amino acids", Proc. Natl. 
Acad ScL USA 52:5131-5135 (1985). This "Simultaneous Multiple Peptide 
Synthesis (SMPS)" process is further described in U.S. Patent No. 4,63 1,21 1 to 
Houghten et al (1986). 

As one of skill in the art will appreciate, METH1 or METH2 polypeptides 
of the present invention and the epitope-bearing fragments thereof described 
above can be combined with parts of the constant domain of immunoglobulins 
(IgG), resulting in chimeric polypeptides. These fusion proteins facilitate 
purification and show an increased half-life in vivo. This has been shown, e.g., for 
chimeric proteins consisting of the first two domains of the human 
CD4-polypeptide and various domains of the constant regions of the heavy or light 
chains of mammalian immunoglobulins (EPA 394,827; Traunecker et al., Nature 
331:84- 86 (1988)). Fusion proteins that have a disulfide-linked dimeric structure 
due to the IgG part can also be more efficient in binding and neutralizing other 
molecules than the monomelic METH1 or METH2 protein or protein fragment 
alone (Fountoulakis etal. J. Biochem. 270:3958-3964 (1995)). 

METH1 and METH2 Polynucleotide and Polypeptide Fragments 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clones 
or shown in SEQ ID NO: 1 or SEQ ID NO:3. The short nucleotide fragments are 
preferably at least about 1 5 nt, and more preferably at least about 20 nt, still more 
preferably at least about 30 nt, and even more preferably, at least about 40 nt in 
length. A fragment "at least 20 nt in length," for example, is intended to include 
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20 or more contiguous bases from the cDN A sequence contained in the deposited 
clones or the nucleotide sequence shown in SEQ ID NO:l or SEQ ID NO:3. 
These nucleotide fragments are useful as diagnostic probes and primers as 
discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 2000 
nucleotides) are preferred. 

Moreover, representative examples ofMETHl orMETH2 polynucleotide 
fragments include, for example, fragments having a sequence from about 
nucleotide number 1-50, 51-100, 101-150, 151-200, 201-250, 25 1-300, 301-350, 
351-400,401-450,451-500,501-550,551-600,651-700,701-750,751-800, 800- 
850, 851-900, 901-950, 951-1000, 1001-1050, 1051-1100, 1101-1150, 1151- 
1200, 1201-1250, 1251-1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 
1501-1550, 1551-1600, 1601-1650, 1651-1700, 1701-1750, 1751-1800, 1801- 
1850, 1851-1900, 1901-1950, 1951-2000, or 2001 to the end of SEQ ID NO:l 
or SEQ ID NO: 3 or the cDNA contained in the deposited clones. In this context 
"about" includes the particularly recited ranges, larger or smaller by several (5, 4, 
3, 2, or 1) nucleotides, at either terminus or at both termini. Preferably, these 
fragments encode a polypeptide which has biological activity. More preferably, 
these polynucleotides can be used as probes or primers as discussed herein. 

In the present invention, a "polypeptide fragment" refers to a short amino 
acid sequence contained in SEQ ID NO:2 or SEQ ED NO:4 or encoded by the 
cDNA contained in the deposited clones. Protein fragments may be "free- 
standing," or comprised within a larger polypeptide of which the fragment forms 
a part or region, most preferably as a single continuous region. Representative 
examples of polypeptide fragments of the invention, include, for example, 
fragments from about amino acid number 1 -20, 2 1 -40, 4 1 -60, 6 1 -80, 8 1 - 1 00, 1 02- 
120, 121-140, 141-160, 161-180, 181-200,201-220,221-240, 241-260, 261-280, 
or 281 to the end of the coding region or SEQ ID NO:2 or SEQ ID NO:4. 
Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 1 00, 
1 1 0, 120, 1 30, 140, or 1 50 amino acids in length. In this context "about" includes 



WO 99/37660 



PCTAJS99/01313 



-66- 

the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1 ) amino 
acids, at either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted METH 1 or METH2 
protein as well as the mature form. Further preferred polypeptide fragments 
5 include the secreted METH1 or METH2 protein or the mature form having a 

continuous series of deleted residues from the amino or the carboxy terminus, or 
both. For example, any number of amino acids, ranging from 1 -60, can be deleted 
from the amino terminus of either the secreted METH1 or METH2 polypeptide 
or the mature form. Similarly, any number of amino acids, ranging from 1 -30, can 

10 be deleted from the carboxy terminus of the secreted METH1 or METH2 protein 

or mature form. Furthermore, any combination of the above amino and carboxy 
terminus deletions are preferred. Similarly, polynucleotide fragments encoding 
these METH1 or METH2 polypeptide fragments are also preferred. 

Particularly, N-terminal deletions of the METH1 polypeptide can be 

15 described by the general formula m-950, where m is an integer from 2 to 949, 

where m corresponds to the position of the amino acid residue identified in SEQ 
ID NO:2. Preferably, N-terminal deletions of the METH1 polypeptide of the 
invention shown as SEQ ID NO:2 include polypeptides comprising the amino acid 
sequence of residues: G-2 to S-950; N-3 to S-950; A-4 to S-950; E-5 to S-950; 

20 R-6 to S-950; A-7 to S-950; P-8 to S-950; G-9 to S-950; S-10 to S-950; R-l 1 to 

S-950; S-12 to S-950; F-13 to S-950; G-14 to S-950; P-15 to S-950; V-16 to S- 
950; P-17 to S-950; T-l 8 to S-950; L-19 to S-950; L-20 to S-950; L-21 to S-950; 
L-22 to S-950; A-23 to S-950; A-24 to S-950; A-25 to S-950; L-26 to S-950; L- 
27 to S-950; A-28 to S-950; V-29 to S-950; S-30 to S-950; D-3 1 to S-950; A-32 

25 to S-950; L-33 to S-950; G-34 to S-950; R-35 to S-950; P-36 to S-950; S-37 to 

S-950; E-38 to S-950; E-39 to S-950; D-40 to S-950; E-41 to S-950; E-42 to S- 
950; L-43 to S-950; V-44 to S-950; V-45 to S-950; P-46 to S-950; E-47 to S- 
950; L~48 to S-950; E-49 to S-950; R-50 to S-950; A-51 to S-950; P-52 to S- 
950; G-53 to S-950; H-54 to S-950; G-55 to S-950; T-56 to S-950; T-57 to S- 

30 950; R-58 to S-950; L-59 to S-950; R-60 to S-950; L-61 to S-950; H-62 to S- 
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950; A-63 to S-950; F-64 to S-950; D-65 to S-950; Q-66 to S-950; Q-67 to S- 
950; L-68 to S-950; D-69 to S-950; L-70 to S-950; E-71 to S-950; L-72 to S- 
950; R-73 to S-950; P-74 to S-950; D-75 to S-950; S-76 to S-950; S-77 to S-950; 
F-78 to S-950; L-79 to S-950; A-80 to S-950; P-81 to S-950; G-82 to S-950; F- 
83 to S-950; T-84 to S-950; L-85 to S-950; Q-86 to S-950; N-87 to S-950; V-88 
to S-950; G-89 to S-950; R-90 to S-950; K-91 to S-950; S-92 to S-950; G-93 to 
S-950; S-94 to S-950; E-95 to S-950; T-96 to S-950; P-97 to S-950; L-98 to S- 
950; P-99 to S-950; E-100 to S-950; T-101 to S-950; D-102 to S-950; L-103 to 
S-950; A-104 to S-950; H-105 to S-950; C-106 to S-950; F-l 07 to S-950; Y-108 
to S-950; S-l 09 to S-950; G-l 10 to S-950; T-l 1 1 to S-950; V-ll 2 to S-950; N- 
1 13 to S-950; G-l 14 to S-950; D-l 15 to S-950; P-l 16 to S-950; S-l 17 to S-950; 
S-l 18 to S-950; A-l 19 to S-950; A-120 to S-950; A-121 to S-950; L-122 to S- 
950; S-123 to S-950; L-124 to S-950; C-125 to S-950; E-126 to S-950; G-127 
to S-950; V-128 to S-950; R-129 to S-950; G-130to S-950; A-131 to S-950; F- 
132 to S-950; Y-133 to S-950; L-134 to S-950; L-135 to S-950; G-136 to S-950; 
E-137 to S-950; A-138 to S-950; Y-139 to S-950; F-l 40 to S-950; 1-141 to S- 
950; Q-142 to S-950; P-143 to S-950; L-144 to S-950; P-145 to S-950; A-146 
to S-950; A-147 to S-950; S-148 to S-950; E-149 to S-950; R-150 to S-950; L- 
151 to S-950; A-l 52 to S-950; T-l 53 to S-950; A-l 54 to S-950; A-l 55 to S-950; 
P-156 to S-950; G-157 to S-950; E-158 to S-950; K-159 to S-950; P-160 to S- 
950; P-161 to S-950; A-162 to S-950; P-163 to S-950; L-164 to S-950; Q-165 
to S-950; F-166 to S-950; H-167 to S-950; L-168 to S-950; L-169 to S-950; R- 
170 to S-950; R-171 to S-950; N-172 to S-950; R-173 to S-950; Q-174 to S-950; 
G-175 to S-950; D-176 to S-950; V-177 to S-950; G-178 to S-950; G-179 to S- 
950; T-180 to S-950; C-181 to S-950; G-182 to S-950; V-183 to S-950; V-184 
to S-950; D-l 85 to S-950; D-l 86 to S-950; E-187 to S-950; P-l 88 to S-950; R- 
189 to S-950; P-190to S-950; T-191 to S-950; G-192 to S-950; K-193 to S-950; 
A-194 to S-950; E-195 to S-950; T-196 to S-950; E-197 to S-950; D-198 to S- 
950; E-199 to S-950; D-200 to S-950; E-201 to S-950; G-202 to S-950; T-203 
to S-950; E-204 to S-950; G-205 to S-950; E-206 to S-950; D-207 to S-950; E- 
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208 to S-950; G-209 to S-950; P-210 to S-950; Q-21 1 to S-950; W-212 toS-950; 
S-213 to S-950; P-214 to S-950; Q-215 to S-950; D-216 to S-950; P-217 to S- 
950; A-218 to S-950; L-219 to S-950; Q-220 to S-950; G-221 to S-950; V-222 
to S-950; G-223 to S-950; Q-224 to S-950; P-225 to S-950; T-226 to S-950; G- 
227 to S-950; T-228 to S-950; G-229 to S-950; S-230 to S-950; 1-23 1 to S-950; 
R-232 to S-950; K-233 to S-950; K-234 to S-950; R-235 to S-950; F-236 to S- 
950; V-237 to S-950; S-238 to S-950; S-239 to S-950; H-240 to S-950; R-241 
to S-950; Y-242 to S-950; V-243 to S-950; E-244 to S-950; T-245 to S-950; M- 
246 to S-950; L-247 to S-950; V-248 to S-950; A-249 to S-950; D-250 to S-950; 
Q-251 to S-950; S-252 to S-950; M-253 to S-950; A-254 to S-950; E-255 to S- 
950; F-256 to S-950; H-257 to S-950; G-258 to S-950; S-259 to S-950; G-260 
to S-950; L-261 to S-950; K-262 to S-950; H-263 to S-950; Y-264 to S-950; L- 
265 to S-950; L-266 to S-950; T-267 to S-950; L-268 to S-950; F-269 to S-950; 
S-270 to S-950; V-271 to S-950; A-272 to S-950; A-273 to S-950; R-274 to S- 
950; L-275 to S-950; Y-276 to S-950; K-277 to S-950; H-278 to S-950; P-279 
to S-950; S-280 to S-950; 1-281 to S-950; R-282 to S-950; N-283 to S-950; S- 
284 to S-950; V-285 to S-950; S-286 to S-950; L-287 to S-950; V-288 to S-950; 
V-289 to S-950; V-290 to S-950; K-291 to S-950; 1-292 to S-950; L-293 to S- 
950; V-294 to S-950; 1-295 to S-950; H-296 to S-950; D-297 to S-950; E-298 
to S-950; Q-299 to S-950; K-300 to S-950; G-301 to S-950; P-302 to S-950; E- 
303 to S-950; V-304 to S-950; T-305 to S-950; S-306 to S-950; N-307 to S-950; 
A-308 to S-950; A-309 to S-950; L-310 to S-950; T-31 1 to S-950; L-312 to S- 
950; R-313 to S-950; N-314 to S-950; F-315 to S-950; C-316 to S-950; N-317 
to S-950; W-3 1 8 to S-950; Q-3 19 to S-950; K-320 to S-950; Q-321 to S-950; H- 
322 to S-950; N-323 to S-950; P-324 to S-950; P-325 to S-950; S-326 to S-950; 
D-327 to S-950; R-328 to S-950; D-329 to S-950; A-330 to S-950; E-33 1 to S- 
950; H-332 to S-950; Y-333 to S-950; D-334 to S-950; T-335 to S-950; A-336 
to S-950; 1-337 to S-950; L-338 to S-950; F-339 to S-950; T-340 to S-950; R- 
341 to S-950; Q-342 to S-950; D-343 to S-950; L-344 to S-950; C-345 to S-950; 
G-346 to S-950; S-347 to S-950; Q-348 to S-950; T-349 to S-950; C-350 to S- 
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950; D-351 to S-950; T-352 to S-950; L-353 to S-950; G-354 to S-950; M-355 
to S-950; A-356 to S-950; D-357 to S-950; V-358 to S-950; G-359 to S-950; T- 
360 to S-950; V-36 1 to S-950; C-362 to S-950; D-363 to S-950; P-364 to S-950; 
S-365 to S-950; R-366 to S-950; S-367 to S-950; C-368 to S-950; S-369 to S- 
5 950; V-370 to S-950; 1-371 to S-950; E-372 to S-950; D-373 to S-950; D-374 

to S-950; G-375 to S-950; L-376 to S-950; Q-377 to S-950; A-378 to S-950; A- 
379 to S-950; F-380 to S-950; T-381 to S-950; T-382 to S-950; A-383 to S-950; 
H-384 to S-950; E-385 to S-950; L-386 to S-950; G-387 to S-950; H-388 to S- 
950; V-389 to S-950; F-390 to S-950; N-391 to S-950; M-392 to S-950; P-393 
10 to S-950; H-394 to S-950; D-395 to S-950; D-396 to S-950; A-397 to S-950; K- 

398 to S-950; Q-399 to S-950; C-400 to S-950; A-401 to S-950; S-402 to S-950; 
L-403 to S-950; N-404 to S-950; G-405 to S-950; V-406 to S-950; N-407 to S- 
950; Q-408 to S-950; D-409 to S-950; S-410 to S-950; H-41 1 to S-950; M-412 
to S-950; M-413 to S-950; A-414 to S-950; S-415 to S-950; M-41 6 to S-950; L- 
15 417 to S-950; S-41 8 to S-950; N-419 to S-950; L-420 to S-950; D-421 to S-950; 

H-422 to S-950; S-423 to S-950; Q-424 to S-950; P-425 to S-950; W-426 to S- 
950; S-427 to S-950; P-428 to S-950; C-429 to S-950; S-430 to S-950; A-43 1 to 
S-950; Y-432 to S-950; M-433 to S-950; 1-434 to S-950; T-435 to S-950; S-436 
to S-950; F-437 to S-950; L-438 to S-950; D-439 to S-950; N-440 to S-950; G- 
20 441 to S-950; H-442 to S-950; G-443 to S-950; E-444 to S-950; C-445 to S-950; 

L-446 to S-950; M-447 to S-950; D-448 to S-950;.K-449 to S-950; P-450 to S- 
950; Q-451 to S-950; N-452 to S-950; P-453 to S-950; 1-454 to S-950; Q-455 to 
S-950; L-456 to S-950; P-457 to S-950; G-458 to S-950; D-459 to S-950; L-460 
to S-950; P-161 to S-950; G-462 to S-950; T-463 to S-950; S-464 to S-950; Y- 
25 465 to S-950; D-466 to S-950; A-467 to S-950; N-468 to S-950; R-469 to S-950; 

Q-470 to S-950; C-471 to S-950; Q-472 to S-950; F-473 to S-950; T-474 to S- 
950; F-475 to S-950; G-476 to S-950; E-477 to S-950; D-478 to S-950; S-479 
to S-950; K-480 to S-950; H-48 1 to S-950; C-482 to S-950; P-483 to S-950; D- 
484 to S-950; A-485 to S-950; A-486 to S-950; S-487 to S-950; T-488 to S-950; 
30 C-489 to S-950; S-490 to S-950; T-491 to S-950; L-492 to S-950; W-493 to S- 
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950; C-494 to S-950; T-495 to S-950; G-496 to S-950; T-497 to S-950; S-498 
to S-950; G-499 to S-950; G-500 to S-950; V-501 to S-950; L-502 to S-950; V- 
503 to S-950; C-504 to S-950; Q-505 to S-950; T-506 to S-950; K-507 to S-950; 
H-508 to S^950; F-509 to S-950; P-510 to S-950; W-51 1 to S-950; A-512 to S- 
5 950; D-513 to S-950; G-514 to S-950; T-515 to S-950; S-516 to S-950; C-517 

to S-950; G-518 to S-950; E-519 to S-950; G-520 to S-950; K-521 to S-950; W- 
522 to S-950; C-523 to S-950; 1-524 to S-950; N-525 to S-950; G-526 to S-950; 
K-527 to S-950; C-528 to S-950; V-529 to S-950; N-530 to S-950; K-531 to S- 
950; T-532 to S-950; D-533 to S-950; R-534 to S-950; K-535 to S-950; H-536 
10 to S-950; F-537 to S-950; D-538 to S-950; T-539 to S-950; P-540 to S-950; F- 

541 to S-950; H-542to S-950; G-543 to S-950; S-544 to S-950; W-545 to S-950; 
G-546 to S-950; M-547 to S-950; W-548 to S-950; G-549 to S-950; P-550 to S- 
950; W-551 to S-950; G-552 to S-950; D-553 to S-950; C-554 to S-950; S-555 
to S-950; R-556 to S-950; T-557 to S-950; C-558 to S-950; G-559 to S-950; G- 
15 560 to S-950; G-561 to S-950; V-562 to S-950; Q-563 to S-950; Y-564 to S-950; 

T-565 to S-950; M-566 to S-950; R-567 to S-950; E-568 to S-950; C-569 to S- 
950; D-570 to S-950; N-571 to S-950; P-572 to S-950; V-573 to S-950; P-574 
to S-950; K-575 to S-950; N-576 to S-950; G-577 to S-950; G-578.to S-950; K- 
579 to S-950; Y-580 to S-950; C-581 to S-950; E-582 to S-950; G-583 to S-950; 
20 K-584 to S-950; R-585 to S-950; V-586 to S-950; R-587 to S-950; Y-588 to S- 

950; R-589 to S-950; S-590 to S-950; C-591 to S-950; N-592 to S-950; L-593 
to S-950; E-594 to S-950; D-595 to S-950; C-596 to S-950; P-597 to S-950; D- 
598 to S-950; N-599 to S-950; N-600 to S-950; G-601 to S-950; K-602 to S-950; 
T-603 to S-950; F-604 to S-950; R-605 to S-950; E-606 to S-950; E-607 to S- 
25 950; Q-608 to S-950; C-609 to S-950; E-610 to S-950; A-61 1 to S-950; H-612 

to S-950; N-613 to S-950; E-614 to S-950; F-61 5 to S-950; S-616 to S-950; K- 
617 to S-950; A-618 to S-950; S-619 to S-950; F-620 to S-950; G-621 to S-950; 
S-622 to S-950; G-623 to S-950; P-624 to S-950; A-625 to S-950; V-626 to S- 
950; E-627 to S-950; W-628 to S-950; 1-629 to S-950; P-630 to S-950; K-631 
30 to S-950; Y-632 to S-950; A-633 to S-950; G-634 to S-950; V-635 to S-950; S- 
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636 to S-950; P-637 to S-950; K-638 to S-950; D-639 to S-950; R-640 to S-950; 
C-641 to S-950; K-642 to S-950; L-643 to S-950; 1-644 to S-950; C-645 to S- 
950; Q-646 to S-950; A-647 to S-950; K-648 to S-950; G-649 to S-950; 1-650 
to S-950; G-651 to S-950; Y-652 to S-950; F-653 to S-950; F-654 to S-950; V- 
655 to S-950; L-656 to S-950; Q-657 to S-950; P-658 to S-950; K-659 to S-950; 
V-660 to S-950; V-661 to S-950; D-662 to S-950; G-663 to S-950; T-664 to S- 
950; P-665 to S-950; C-666 to S-950; S-667 to S-950; P-668 to S-950; D-669 to 
S-950; S-670 to S-950; T-671 to S-950; S-672 to S-950; V-673 to S-950; C-674 
to S-950; V-675 to S-950; Q-676 to S-950; G-677 to S-950; Q-678 to S-950; C- 
679 to S-950; V-680 to S-950; K-68 1 to S-950; A-682 to S-950; G-683 to S-950; 
C-684 to S-950; D-685 to S-950; R-686 to S-950; 1-687 to S-950; 1-688 to S- 
950; D-689 to S-950; S-690 to S-950; K-691 to S-950; K-692 to S-950; K-693 
to S-950; F-694 to S-950; D-695 to S-950; K-696 to S-950; C-697 to S-950; G- 
698 to S-950; V-699 to S-950; C-700 to S-950; G-701 to S-950; G-702 to S-950; 
N-703 to S-950; G-704 to S-950; S-705 to S-950; T-706 to S-950; C-707 to S- 
950; K-708 to S-950; K-709 to S-950; 1-710 to S-950; S-71 1 to S-950; G-712 to 
S-950; S-713 to S-950; V-714 to S-950; T-71 5 to S-950; S-71 6 to S-950; A-717 
to S-950; K-718 to S-950; P-719 to S-950; G-720 to S-950; Y-721 to S-950; H- 
722 to S-950; D-723 to S-950; 1-724 to S-950; 1-725 to S-950; T-726 to S-950; 
20 1-727 to S-950; P-728 to S-950; T-729 to S-950; G-730 to S-950; A-73 1 to S- 

950; T-732 to S-950; N-733 to S-950; 1-734 to S-950; E-735 to S-950; V-736 to 
S-950; K-737 to S-950; Q-738 to S-950;R-739to S-950; N-740 to S-950; Q-741 
to S-950; R-742 to S-950; G-743 to S-950; S-744 to S-950; R-745 to S-950; N- 
746 to S-950; N-747 to S-950; G-748 to S-950; S-749 to S-950; F-750 to S-950; 
L-751 to S-950; A-752 to S-950; 1-753 to S-950; K-754 to S-950; A-755 to S- 
950; A-756 to S-950; D-757 to S-950; G-758 to S-950; T-759 to S-950; Y-760 
to S-950; 1-761 to S-950; L-762 to S-950; N-763 to S-950; G-764 to S-950; D- 
765 to S-950; Y-766 to S-950; T-767 to S-950; L-768 to S-950; S-769 to S-950; 
T-770 to S-950; L-771 to S-950; E-772 to S-950; Q-773 to S-950; D-774 to S- 
30 950; 1-775 to S-950; M-776 to S-950; Y-777 to S-950; K-778 to S-950; G-779 
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to S-950" V-780 to S-950; V-781 to S-950; L-782 to S-950; R-783 to S-950; Y- 
784 to S-950; S-785 to S-950; G-786to S-950; S-787 to S-950; S-788 to S-950; 
A-789 to S-950; A-790 to S-950; L-791 to S-950; E-792 to S-950; R-793 to S- 
950 1-794 to S-950; R-795 to S-950; S-796 to S-950; F-797 to S-950; S-798 to 
5 S-950 P-799 to S-950; L-800 to S-950; K-801 to S-950; E-802 to S-950; P-803 

to S-950; L-804 to S-950; T-805 to S-950; 1-806 to S-950; Q-807 to S-950; V- 
808 to S-950; L-809 to S-950; T-810 to S-950; V-81 1 to S-950; G-81 2 to S-950; 
N-813 to S-950; A-814 to S-950; L-815 to S-950; R-816 to S-950; P-817 to S- 
950; K-818 to S-950; 1-819 to S-950; K-820 to S-950; Y-821 to S-950; T-822 to 
10 S-950; Y-823 to S-950; F-824 to S-950; V-825 to S-950; K-826 to S-950; K-827 

to S-950; K-828 to S-950; K-829 to S-950; E-830 to S-950; S-83 1 to S-950; F- 
832 to S-950; N-833 to S-950; A-834 to S-950; 1-835 to S-950; P-836 to S-950; 
T-837 to S-950; F-838 to S-950; S-839 to S-950; A-840 to S-950; W-841 to S- 
950 V-842 to S-950; 1-843 to S-950; E-844 to S-950; E-845 to S-950; W-846 
15 to S-950; G-847 to S-950; E-848 to S-950; C-849 to S-950; S-850 to S-950; K- 

851 to S-950; S-852 to S-950; C-853 to S-950; E-854 to S-950; L-855 to S-950; 
G-856 to S-950; W-857 to S-950; Q-858 to S-950; R-859 to S-950; R-860 to S- 
950; L-861 to S-950; V-862 to S-950; E-863 to S-950; C-864 to S-950; R-865 
to S-950; D-866 to S-950; 1-867 to S-950; N-868 to S-950; G-869 to S-950; Q- 
20 870 to S-950; P-871 to S-950; A-872 to S-950; S-873 to S-950; E-874 to S-950; 

C-875 to S-950; A-876 to S-950; K-877 to S-950; E-878 to S-950; V-879 to S- 
950; K-880 to S-950; P-881 to S-950; A-882 to S-950; S-883 to S-950; T-884 
to S-950; R-885 to S-950; P-886 to S-950; C-887 to S-950; A-888 to S-950; D- 
889 to S-950; H-890 to S-950; P-891 to S-950; C-892 to S-950; P-893 to S-950; 
25 Q-894 to S-950; W-895 to S-950; Q-896 to S-950; 1^897 to S-950; G-898 to S- 

950; E-899 to S-950; W-900 to S-950; S-901 to S-950; S-902 to S-950; C-903 
to S-950; S-904 to S-950; K-905 to S-950; T-906 to S-950; C-907 to S-950; G- 
908 to S-950;K-909 to S-950; G-910to S-950; Y-91 1 to S-950;K-912 to S-950; 
K-913 to S-950; R-914 to S-950; S-915 to S-950; L-916 to S-950; K-917 to S- 
30 950; C-918 to S-950; L-919 to S-950; S-920 to S-950; H-921 to S-950; D-922 
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to S-950; G-923 to S-950; G-924 to S-950; V-925 to S-950; L-926 to S-950; S- 
927 to S-950; H-928 to S-950; E-929 to S-950; S-930 to S-950; C-93 1 to S-950; 
D-932 to S-950; P-933 to S-950; L-934 to S-950; K-935 to S-950; K-936 to S- 
950; P-937 to S-950; K-938 to S-950; H-939 to S-950; F-940 to S-950; 1-941 to 
5 S-950; D-942 to S-950; F-943 to S-950; C-944 to S-950; T-945 to S-950; of SEQ 

IDNO:2. 

Moreover, C-terminal deletions of the METH1 polypeptide can also be 
described by the general formula 1-n, where n is an integer from 2 to 950, where 
n corresponds to the position of amino acid residue identified in SEQ ID NO:2. 
10 Preferably, C-terminal deletions of the METH1 polypeptide of the invention 

shown as SEQ ID NO:2 include polypeptides comprising the amino acid sequence 
of residues: M-l to C-949; M-l to E-948; M-l to A-947; M-l to M-946; M-l to 
T-945; M-l to C-944; M-l to F-943; M-l to D-942; M-l to 1-941; M-l to F-940; 
M-l to H-939; M-l to K-938; M-l to P-937; M-l to K-936; M-l to K-935; M-l 
15 to L-934; M-l to P-933; M-l to D-932; M-l to C-93 1 ; M-l to S-930; M-l to E- 

929; M-l to H-928; M-l to S-927; M-l to L-926; M-l to V-925; M-l to G-924; 
M-l to G-923; M-l to D-922; M-l to H-921; M-l to S-920; M-l to L-919; M-l 
to C-918; M-l to K-917; M-l to L-916; M-l to S-915; M-l to R-914; M-l to K- 
913; M-l to K-912; M-l to Y-91 1; M-l to G-910; M-l to K-909; M-l to G-908; 
20 M-l to C-907; M-l to T-906; M-l to K-905; M-l to S-904; M-l to C-903; M-l 

toS-902;M-l toS-901;M-l toW-900;M-l toE-899;M-l toG-898;M-l toL- 
897; M-l toQ-896;M-l toW-895;M-l toQ-894;M-l toP-893;M-l toC-892; 
M-l to P-891; M-l to H-890; M-l to D-889; M-l to A-888; M-l to C-887; M-l 
to P-886; M-l to R-885; M-l to T-884; M-l to S-883; M-l to A-882; M-l to P- 
25 881; M-l to K-880; M-l to V-879; M-l to E-878; M-l to K-877; M-l to A-876; 

M-l to C-875; M-l to E-874; M-l to S-873; M-l to A-872; M-l to P-871; M-l 
to Q-870; M-l to G-869; M-l to N-868; M-l to 1-867; M-l to D-866; M-l to R- 
865; M-l to C-864; M-l to E-863; M-l to V-862; M-l to L-861; M-l toR-860; 
M-l toR-859;M-l toQ-858;M-l toW-857;M-l toG-856;M-l toL-855;M-l 
30 to E-854; M-l to C-853; M-l to S-852; M-l to K-85 1 ; M-l to S-850; M-l to C- 
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849; M-l to E-848; M-l to G-847; M-l to W-846; M-l to E-845; M-l to E-844; 
M-l to 1-843; M-l to V-842; M-l to W-841; M-l to A-840; M-l to S-839; M-l 
to F-838; M-l to T-837; M-l to P-836; M-l to 1-835; M-l to A-834; M-l to N- 
833; M-l to F-832; M-l to S-831; M-l to E-830; M-l to K-829; M-l to K-828; 
5 M-l to K-827; M-l to K-826; M-l to V-825; M-l to F-824; M-l to Y-823; M-l 

toT-822;M-l to Y-821;M-1 toK-820;M-l to 1-819; M-l toK-818;M-l toP- 
817; M-l to R-816; M-l to 1^815; M-l to A-814; M-l to N-813; M-l to G-812; 
M-l to V-81 1; M-l to T-810; M-l to L-809; M-l to V-808; M-l to Q-807; M-l 
to 1-806; M-l to T-805; M-l to L-804; M-l to P-803; M-l to E-802; M-l to K- 
10 801; M-l to L-800; M-l to P-799; M-l to S-798; M-l to F-797; M-l to S-796; 

M-l to R-795; M-l to 1-794; M-l to R-793; M-l to E-792; M-l to L-791 ; M-l 
to A-790; M-l to A-789; M-l to S-788; M-l to S-787; M-l to G-786; M-l to S- 
785; M-l toY-784;M-l toR-783;M-l toL-782;M-l to V-781;M-1 to V-780; 
M-l to G-779; M-l to K-778; M-l to Y-777; M-l to M-776; M-l to 1-775; M-l 
15 to D-774; M-l to Q-773; M-l to E-772; M-l to L-771; M-l to T-770; M-l to S- 

769; M-l to L-768; M-l to T-767; M-l to Y-766; M-l to D-765; M-l to G-764; 
M-l to N-763; M-l to L-762; M-l to 1-761; M-l to Y-760; M-l to T-759; M-l 
to G-758; M l to D-757; M-l to A-756; M-l to A-755; M-l to K-754; M-l to 
1-753; M-l to A-752; M-l to L-75 1 ; M-l to F-750; M-l to S-749; M-l to G-748; 
20 M-l to N-747; M-l to N-746; M-l to R-745; M-l to S-744; M-l to G-743; M-l 

to R-742; M-l to Q-741; M-l to N-740; M-l to R-739; M-l to Q-738; M-l to 
K-737; M-l to V-736; M-l to E-735; M-l to 1-734; M-l to N-733; M-l to T- 
732; M-l to A-731; M-l to G-730; M-l to T-729; M-l to P-728; M-l to 1-727; 
M-l to T-726; M-l to 1-725; M-l to 1-724; M-l to D-723; M-l to H-722; M-l 
25 to Y-721 ; M-l to G-720; M-l to P-719; M-l to K-718; M-l to A-717; M-l to S- 

716; M-l to T-715; M-l to V-714; M-l to S-713; M-l to G-712; M-l to S-71 1; 
M-l to 1-710; M-l to K-709; M-l to K-708; M-l to C-707; M-l to T-706; M-l 
to S-705; M-l to G-704; M-l toN-703; M-l to G-702; M-l to G-701; M-l to C- 
700; M-l to V-699; M-l to G-698; M-l to C-697; M-l to K-696; M-l to D-695; 
30 M-l toF-694;M-l toK-693;M-l toK-692;M-l toK-691;M-l toS-690;M-l 
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to D-689; M-l to 1-688; M-l to 1-687; M-l to R-686; M-l to D-685; M-l to C- 
684;M-1 toG-683;M-l toA-682;M-l toK-681;M-l to V-680; M-l toC-679; 
M-l to Q-678; M-l to G-677; M-l to Q-676; M-l to V-675; M-l to C-674; M-l 
to V-673; M-l to S-672; M-l to T-671; M-l to S-670; M-l to D-669; M-l to P- 
668; M-l to S-667; M-l to C-666; M-l to P-665; M-l to T-664; M-l to G-663; 
M-l to D-662; M-l to V-661; M-l to V-660; M-l to K-659; M-l to P-658; M-l 
to Q-657; M-l to L-656; M-l to V-655; M-l to F-654; M-l to F-653; M-l to Y- 
652; M-l to G-651; M-l to 1-650; M-l to G-649; M-l to K-648; M-l to A-647; 
M-l to Q-646; M-l to C-645; M-l to 1-644; M-l to L-643; M-l to K-642; M-l 
to C-641; M-l to R-640; M-l to D-639; M-l to K-638; M-l to P-637; M-l to S- 
636; M-l to V-635; M-l to G-634; M-l to A-633; M-l to Y-632; M-l to K-63 1 ; 
M-l to P-630; M-l to 1-629; M-l to W-628; M-l to E-627; M-l to V-626; M-l 
to A-625; M-l to P-624; M-l to G-623; M-l to S-622; M-l to G-621; M-l to F- 
620; M-l to S-619; M-l to A-618; M-l to K-617; M-l to S-616; M-l to F-615; 
M-l to E-614; M-l to N-613; M-l to H-612; M-l to A-61 1 ; M-l to E-610; M-l 
to C-609; M-l to Q-608; M-l to E-607; M-l to E-606; M-l to R-605; M-l to F- 
604; M-l to T-603; M-l to K-602; M-l to G-601 ; M-l to N-600; M-l to N-599; 
M-l to D-598; M-l to P-597; M-l to C-596; M-l to D-595; M-l to E-594; M-l 
toL-593; M-l to N-592; M-l to C-591; M-l to S-590; M-l to R-589; M-l to Y- 
588; M-l to R-587; M-l to V-586; M-l to R-585; M-l to K-584; M-l to G-583; 
M-l to E-582; M-l to C-581; M-l to Y-580; M-l to K-579; M-l to G-578; M-l 
to G-577; M-l to N-576; M-l to K-575; M-l to P-574; M-l to V-573; M-l to P- 
572; M-l to N-571; M-l to D-570; M-l to C-569; M-l to E-568; M-l to R-567; 
M-l to M-566; M-l to T-565; M-l to Y-564; M-l to Q-563; M-l to V-562; M-l 
to G-561 ; M-l to G-560; M-l to G-559; M-l to C-558; M-l to T-557; M-l to R- 
556; M-l to S-555; M-l toC-554;M-l to D-553; M-l to G-552; M-l to W-551; 
M-l to P-550; M-l to G-549; M-l to W-548; M-l to M-547; M-l to G-546; M-l 
to W-545; M-l to S-544; M-l to G-543; M-l to H-542; M-l to F-541; M-l to 
P-540; M-l to T-539; M-l to D-538; M-l to F-537; M-l to H-536; M-l to K- 
535;M-1 toR-534;M-l toD-533;M-l toT-532;M-l toK-531;M-l toN-530; 
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M-l to V-529; M-l to C-528; M-l to K-527; M-l to G-526; M-l to N-525; M-l 
to 1-524; M-l to C-523; M-l to W-522; M-l to K-521; M-l to G-520; M-l to E- 
519; M-l toG-518;M-l toC-517;M-l toS-516;M-l toT-515; M-l toG-514; 
M-l to D-513; M-l to A-512; M-l to W-51 1; M-l to P-510; M-l to F-509; M-l 
to H-508; M-l to K-507; M-l to T-506; M-l to Q-505; M-l to C-504; M-l to V- 
503; M-l to L-502; M-l to V-501; M-l to G-500; M-l to G-499; M-l to S-498; 
M-l to T-497; M-l to G-496; M-l to T-495; M-l to C-494; M-l to W-493; M-l 
to L-492; M-l to T-491 ; M-l to S-490; M-l to C-489; M-l to T-488; M-l to S- 
487; M-l to A-486; M-l to A-485; M-l to D-484; M-l to P-483; M-l to C-482; 
M-l to H-481; M-l to K-480; M-l to S-479; M-l to D-478; M-l to E-477; M-l 
to G-476; M-l to F-475; M-l to T-474; M-l to F-473; M-l to Q-472; M-l to C- 
471; M-l to Q-470; M-l to R-469; M-l to N-468; M-l to A-467; M-l to D-466; 
M-l to Y-465; M-l to S-464; M-l to T-463; M-l to G-462; M-l to P-461; M-l 
to L-460; M-l to D-459; M-l to G-458; M-l to P-457; M-l to L-456; M-l to Q- 
455; M-l to 1-454; M-l to P-453; M-l to N-452; M-l to Q-451; M-l to P-450; 
M-l to K-449; M-l to D-448; M-l to M-447; M-l to L-446; M-l to C-445; M-l 
to E-444; M-l to G-443; M-l to H-442; M-l to G-441; M-l to N-440; M-l to 
D-439; M-l to L-438; M-l to F-437; M-l to S-436; M-l to T-435; M-l to 1-434; 
M-l toM-433;M-l to Y-432; M-l toA-431;M-l toS-430;M-l toC-429; M-l 
to P-428; M-l to S-427; M-l to W-426; M-l to P-425; M-l to Q-424; M-l to S- 
423; M-l to H-422; M-l to D-421; M-l to L-420; M-l to N-419; M-l to S-418; 
M-l to L-417; M-l to M-416; M-l to S-415; M-l to A-414; M-l to M-413; M-l 
to M-412; M-l to H-41 1; M-l to S-410; M-l to D-409; M-l to Q-408; M-l to 
N-407; M-l to V-406; M-l to G-405; M-l to N-404; M-l to L-403; M-l to S- 
402; M-l to A-401; M-l to C-400; M-l to Q-399; M-l to K-398; M-l to A-397; 
M-l to D-396; M-l to D-395; M-l to H-394; M-l to P-393; M-l to M-392; M-l 
to N-391; M-l to F-390; M-l to V-389; M-l to H-388; M-l to G-387; M-l to L- 
386; M-l to E-385; M-l to H-384; M-l to A-383; M-l to T-382; M-l to T-381; 
M-l to F-380; M-l to A-379; M-l to A-378; M-l to Q-377; M-l to L-376; M-l 
to G-375; M-l to D-374; M-l to D-373; M-l to E-372; M-l to 1-371; M-l to V- 
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370; M-l to S-369; M-l to C-368; M-l to S-367; M-l to R-366; M-l to S-365; 
M-l to P-364; M-l to D-363; M-l to C-362; M-l to V-361 ; M-l to T-360; M-l 
to G-359; M-l to V-358; M-l to D-357; M-l to A-356; M-l to M-355; M-l to 
G-354; M-l to L-353; M-l to T-352; M-l to D-351; M-l to C-350; M-l to T- 

5 349; M-l to Q-348; M-l to S-347; M-l to G-346; M-l to C-345; M-l to L-344; 

M-l to D-343; M-l to Q-342; M-l to R-341; M-l to T-340; M-l to F-339; M-l 
to L-338; M-l to 1-337; M-l to A-336; M-l to T-335; M-l to D-334; M-l to Y- 
333; M-l to H-332; M-l to E-331; M-l to A-330; M-l to D-329; M-l to R-328; 
M-l to D-327; M-l to S-326; M-l to P-325; M-l to P-324; M-l to N-323; M-l 

10 to H-322; M-l to Q-321; M-l to K-320; M-l to Q-319; M-l to W-318; M-l to 

N-317; M-l to C-316; M-l to F-315; M-l to N-314; M-l to R-313; M-l to L- 
312; M-l toT-311;M-l toL-310;M-l to A-309;M-1 to A-308;M-1 toN-307; 
M-l to S-306; M-l to T-305; M-l to V-304; M-l to E-303; M-l to P-302; M-l 
to G-301; M-l to K-300; M-l to Q-299; M-l to E-298; M-l to D-297; M-l to 

15 H-296;M-1 toI-295;M-l toV-294;M-l toL-293;M-l to 1-292; M-l toK-291; 

M-l to V-290; M-l to V-289; M-l to V-288; M-l to L-287; M-l to S-286; M-l 
to V-285; M-l to S-284; M-l to N-283; M-l to R-282; M-l to 1-281; M-l to S- 
280; M-l to P-279; M-l to H-278; M-l to K-277; M-l to Y-276; M-l to L-275; 
M-l to R-274; M-l to A-273; M-l to A-272; M-l to V-271 ; M-l to S-270; M-l 

20 to F-269; M-l to L-268; M-l to T-267; M-l to L-266; M-l to L-265; M-l to Y- 

264; M-l to H-263; M-l to K-262; M-l to L-261; M-l to G-260; M-l to S-259; 
M-l to G-258; M-l to H-257; M-l to F-256; M-l to E-255; M-l to A-254; M-l 
to M-253; M-l to S-252; M-l to Q-251; M-l to D-250; M-l to A-249; M-l to 
V-248; M-l to L-247; M-l to M-246; M-l to T-245; M-l to E-244; M-l to V- 

25 243; M-l to Y-242; M-l to R-241; M-l to H-240; M-l to S-239; M-l to S-238; 

M-l to V-237; M-l to F-236; M-l to R-235; M-l to K-234; M-l to K-233; M-l 
to R-232; M-l to 1-231; M-l to S-230; M-l to G-229; M-l to T-228; M-l to G- 
227; M-l to T-226; M-l to P-225; M-l to Q-224; M-l to G-223; M-l to V-222; 
M-l to G-221 ; M-l to Q-220; M-l to L-219; M-l to A-218; M-l to P-217; M-l 

30 to D-216; M-l to Q-215; M-l to P-214; M-l to S-213; M-l to W-212; M-l to 
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Q-21 1; M-l to P-210; M-l to G-209; M-l to E-208; M-l to D-207; M-l to E- 
206; M-l to G-205; M-l to E-204; M-l to T-203; M-l to G-202; M-l to E-201 ; 
M-l toD-200;M-l toE-199;M-l toD-198;M-l toE-197; M-l toT-196;M-l 
toE-195;M-l toA-194;M-l toK-193;M-l toG-192;M-l toT-l91;M-l toP- 
5 190; M-l toR-189;M-l toP-188;M-l toE-187;M-l toD-186;M-l toD-185; 

M-l to V-184; M-l to V-183; M-l to G-182; M-l toC-181; M-l toT-180; M-l 
to G-179; M-l to G-178; M-l to V-177; M-l to D-176; M-l to G-175; M-l to 
Q-174; M-l to R-173; M-l to N-172; M-l to R-171; M-l to R-170; M-l to L- 
169; M-l to L-168; M-l to H-167; M-l to F-166; M-l to Q-165; M-l to L-164; 

10 M-l to P-163; M-l to A-162; M-l to P-161; M-l to P-160; M-l to K-159; M-l 

toE-158;M-l toG-157;M-l toP-156;M-l toA-155;M-l to A-154;M-1 toT- 
153; M-l to A-152; M-l to L-151; M-l to R-150; M-l to E-149; M-l to S-148; 
M-l to A-147; M-l to A-146; M-l to P-145; M-l to L-144; M-l to P-143; M-l 
to Q-142; M-l to 1-141; M-l to F-140; M-l to Y-139; M-l to A-138; M-l to E- 

15 137; M-l to G-136; M-l to L-135; M-l to L-134; M-l to Y-133; M-l to F-132; 

M-l toA-131;M-l toG-130;M-l toR-129;M-l to V-128;M-1 toG-127;M-l 
to E-126; M-l to C-125; M-l to L-124; M-l to S-123; M-l to L-122; M-l to A- 
121; M-l to A-120; M-l to A-l 19; M-l to S-l 18; M-l to S-l 17; M-l to P-l 16; 
M-l to D-l 15; M-l to G-l 14; M-l to N-l 13; M-l to V-l 12; M-l to T-l 1 1; M-l 

20 to G-l 10; M-l to S-109; M-l to Y-108; M-l to F-107; M-l to C-106; M-l to H- 

105; M-l to A-104; M-l to L-103; M-l to D-102; M-l to T-101 ; M-l to E-100; 
M-l to P-99; M-l to L-98; M-l to P-97; M-l to T-96; M-l to E-95; M-l to S-94; 
M-l to G-93; M-l to S-92; M-l to K-91 ; M-l to R-90; M-l to G-89; M-l to V- 
88; M-l to N-87; M-l to Q-86; M-l to L-85; M-l to T-84; M-l to F-83; M-l to 

25 G-82; M-l to P-81; M-l to A-80; M-l to L-79; M-l to F-78; M-l to S-77; M-l 

toS-76;M-l toD-75;M-l toP-74;M-l toR-73;M-l toL-72;M-l toE-71;M- 
1 to L-70; M-l to D-69; M-l to L-68; M-l to Q-67; M-l to Q-66; M-l to D-65; 
M-l to F-64; M-l to A-63; M-l to H-62; M-l to L-61 ; M-l to R-60; M-l to L- 
59; M-l to R-58; M-l to T-57; M-l to T-56; M-l to G-55; M-l to H-54; M-l to 

30 G-53;M-ltoP-52;M-ltoA-51;M-ltoR-50;M-ltoE-49;M-ltoL-48;M-l 
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to E-47; M-l to P-46; M-l to V-45; M-l to V-44; M-l to L-43; M-l to E-42; M- 
1 to E-41; M-l to D-40; M-l to E-39; M-l to E-38; M-l to S-37; M-l to P-36; 
M-l to R-35; M-l to G-34; M-l to L-33; M-l to A-32; M-l to D-31; M-l to S- 
30; M-l to V-29; M-l to A-28; M-l to L-27; M-l to L-26; M-l to A-25; M-l to 
5 A-24; M-l to A-23; M-l to L-22; M-l to L-21; M-l to L-20; M-l to L-19; M-l 

to T-18; M-l to P-17; M-l to V-16; M-l to P-15; M-l to G-14; M-l to F-13; M- 
1 to S-12; M-l to R-l 1; M-l to S-10; M-l to G-9; M-l to P-8; M-l to A-7; of 
SEQ ID NO:2. For example, any of the above listed N- or C-terminal deletions 
can be combined to produce a N- and C-terminal deleted METH1 polypeptide. 

10 Moreover, N-terminal deletions of the METH2 polypeptide can be 

described by the general formula m-890, where m is an integer from 2 to 889, 
where m corresponds to the position of the amino acid residue identified in SEQ 
ID NO:4. Preferably, N-terminal deletions of the METH2 polypeptide of the 
invention shown as SEQ ID NO:4 include polypeptides comprising the amino acid 

15 sequence of residues: F-2 to L-890; P-3 to L-890; A-4 to L-890; P-5 to L-890; 

A-6 to L-890; A-7 to L-890; P-8 to L-890; R-9 to L-890; W-10 to L-890; L-l 1 
to L-890; P-12 to L-890; F-13 to L-890; L-14 to L-890; L-l 5 to L-890; L-l 6 to 
L-890; L-17 to L-890; L-18 to L-890; L-19 to L-890; L-20 to L-890; L-21 to L- 
890; L-22 to L-890; P-23 to L-890; L-24 to L-890; A-25 to L-890; R-26 to L- 

20 890; G-27 to L-890; A-28 to L-890; P-29 to L-890; A-30 to L-890; R-3 1 to L- 

890; P-32 to L-890; A-33 to L-890; A-34 to L-890; G-35 to L-890; G-36 to L- 
890; Q-37 to L-890; A-38 to L-890; S-39 to L-890; E-40 to L-890; L-41 to L- 
890; V-42 to L-890; V-43 to L-890; P-44 to L-890; T-45 to L-890; R-46 to L- 
890; L-47 to L-890; P-48 to L-890; G-49 to L-890; S-50 to L-890; A-51 to L- 

25 890; G-52 to L-890; E-53 to L-890; L-54 to L-890; A-55 to L-890; L-56 to L- 

890; H-57 to L-890; L-58 to L-890; S-59 to L-890; A-60 to L-890; F-61 to L- 
890; G-62 to L-890; K-63 to L-890; G-64 to L-890; F-65 to L-890; V-66 to L- 
890; L-67 to L-890; R-68 to L-890; L-69 to L-890; A-70 to L-890; P-71 to L- 
890; D-72 to L-890; D-73 to L-890; S-74 to L-890; F-75 to L-890; L-76 to L- 

30 890; A-77 to L-890; P-78 to L-890; E-79 to L-890; F-80 to L-890; K-81 to L- 
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890; 1-82 to L-890; E-83 to L-890; R-84 to L-890; L-85 to L-890; G-86 to L- 
890; G-87 to L-890; S-88 to L-890; G-89 to L-890; R-90 to L-890; A-91 to L- 
890; T-92 to L-890; G-93 to L-890; G-94 to L-890; E-95 to L-890; R-96 to L- 
890; G-97 to L-890; L-98 to L-890; R-99 to L-890; G- 1 00 to L-890; C- 1 0 1 to L- 
890; F-102 to L-890; F-103 to L-890; S-104 to L-890; G-105 to L-890; T-106 
to L-890; V-107 to L-890; N-108 to L-890; G-109 to L-890; E-l 10 to L-890; P- 
1 1 1 to L-890; E-l 12 to L-890; S-l 13 to L-890; L-l 14 to L-890; A-l 15 to L-890; 
A-l 16 to L-890; V-l 17 to L-890; S-l 18 to L-890; L-l 19 to L-890; C-120 to L- 
890; R-121 to L-890; G-122 to L-890; L-123 to L-890; S-124 to L-890; G-125 
to L-890; S-126 to L-890; F-127 to L-890; L-128 to L-890; L-129 to L-890; D- 
130 to L-890; G-13 1 to L-890; E-132 to L-890; E-133 to L-890; F-134 to L-890; 
T-135 to L-890; 1-136 to L-890; Q-137 to L-890; P-138 to L-890; Q-139 to L- 
890; G-140 to L-890; A-141 to L-890; G-142 to L-890; G-143 to L-890; S-144 
to L-890; L-145 to L-890; A-146 to L-890; Q-147 to L-890; P-148 to L-890; H- 
149 to L-890; R-l 50 to L-890; L-l 51 to L-890; Q- 152 to L-890; R- 153 to L-890; 
W-154 to L-890; G-155 to L-890; P-156 to L-890; A-l 57 to L-890; G-158 to L- 
890; A-159 to L-890; R-160 to L-890; P-161 to L-890; L-162 to L-890; P-163 
to L-890; R-l 64 to L-890; G-l 65 to L-890; P-l 66 to L-890; E-167 to L-890; W- 
168 to L-890; E-169 to L-890; V-170 to L-890; E-171 to L-890; T-172 to L-890; 
G-173 to L-890; E-174 to L-890; G-175 to L-890; Q-176 to L-890; R-177 to L- 
890; Q-178 to L-890; E-179 to L-890; R-180 to L-890; G-181 to L-890; D-182 
to L-890; H-l 83 to L-890; Q-l 84 to L-890; E-l 85 to L-890; D-l 86 to L-890; S- 
187 to L-890; E-188 to L-890; E-189 to L-890; E-190 to L-890; S-191 to L-890; 
Q-l 92 to L-890; E-l 93 to L-890; E-l 94 to L-890; E-l 95 to L-890; A- 196 to L- 
890; E-l 97 to L-890; G-l 98 to L-890; A- 199 to L-890; S-200 to L-890; E-201 
to L-890; P-202 to L-890; P-203 to L-890; P-204 to L-890; P-205 to L-890; L- 
206 to L-890; G-207 to L-890; A-208 to L-890; T-209 to L-890; S-210 to L-890; 
R-21 1 to L-890; T-212 to L-890; K-213 to L-890; R-214 to L-890; F-21 5 to L- 
890; V-216 to L-890; S-217 to L-890; E-218 to L-890; A-219 to L-890; R-220 
to L-890; F-221 to L-890; V-222 to L-890; E-223 to L-890; T-224 to L-890; L- 
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225 to L-890; L-226 to L-890; V-227 to L-890; A-228 to L-890; D-229 to L-890; 
A-230 to L-890; S-23 1 to L-890; M-232 to L-890; A-233 to L-890; A-234 to L- 
890; F-235 to L-890; Y-236 to L-890; G-237 to L-890; A-238 to L-890; D-239 
to L-890; L-240 to L-890; Q-241 to L-890; N-242 to L-890; H-243 to L-890; I- 
5 244 to L-890; L-245 to L-890; T-246 to L-890; L-247 to L-890; M-248 to L-890; 

S-249 to L-890; V-250 to L-890; A-25 1 to L-890; A-252 to L-890; R-253 to L- 
890; 1-254 to L-890; Y-255 to L-890; K-256 to L-890; H-257 to L-890; P-258 
to L-890; S-259 to L-890; 1-260 to L-890; K-261 to L-890; N-262 to L-890; S- 
263 to L-890; 1-264 to L-890; N-265 to L-890; L-266 to L-890; M-267 to L-890; 

10 V-268 to L-890; V-269 to L-890; K-270 to L-890; V-271 to L-890; L-272 to L- 

890; 1-273 to L-890; V-274 to L-890; E-275 to L-890; D-276 to L-890; E-277 
to L-890; K-278 to L-890; W-279 to L-890; G-280 to L-890; P-28 1 to L-890; E- 
282 to L-890; V-283 to L-890; S-284 to L-890; D-285 to L-890; N-286 to L-890; 
G-287 to L-890; G-288 to L-890; L-289 to L-890; T-290 to L-890; L-291 to L- 

15 890; R-292 to L-890; N-293 to L-890; F-294 to L-890; C-295 to L-890; N-296 

to L-890; W-297 to L-890; Q-298 to L-890; R-299 to L-890; R-300 to L-890; F- 
301 to L-890; N-302 to L-890; Q-303 to L-890; P-304 to L-890; S-305 to L-890; 
D-306 to L-890; R-307 to L-890; H-308 to L-890; P-309 to L-890; E-3 10 to L- 
890; H-3 1 1 to L-890; Y-3 1 2 to L-890; D-3 1 3 to L-890; T-3 1 4 to L-890; A-3 1 5 

20 to L-890; 1-3 1 6 to L-890; L-3 1 7 to L-890; L-3 1 8 to L-890; T-3 1 9 to L-890; R- 

320 to L-890; Q-32 1 to L-890; N-322 to L-890; F-323 to L-890; C-324 to L-890; 
G-325 to L-890; Q-326 to L-890; E-327 to L-890; G-328 to L-890; L-329 to L- 
890; C-330 to L-890; D-331 to L-890; T-332 to L-890; L-333 to L-890; G-334 
to L-890; V-335 to L-890; A-336 to L-890; D-337 to L-890; 1-338 to L-890; G- 

25 339 to L-890; T-340 to L-890; 1-341 to L-890; C-342 to L-890; D-343 to L-890; 

P-344 to L-890; N-345 to L-890; K-346 to L-890; S-347 to L-890; C-348 to L- 
890; S-349 to L-890; V-350 to L-890; 1-351 to L-890; E-352 to L-890; D-353 
to L-890; E-354 to L-890; G-355 to L-890; L-3 56 to L-890; Q-357 to L-890; A- 
358 to L-890; A-359 to L-890; H-360 to L-890; T-36 1 to L-890; L-362 to L-890; 

30 A-363 to L-890; H-364 to L-890; E-365 to L-890; L-366 to L-890; G-367 to L- 
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890; H-368 to L-890; V-369 to L-890; L-370 to L-890; S-371 to L-890; M-372 
to L-890; P-373 to L-890; H-374 to L-890; D-375 to L-890; D-376 to L-890; S- 
377 to L-890; K-378 to L-890; P-379 to L-890; C-380 to L-890; T-381 to L-890; 
R-382 to L-890; L-383 to L-890; F-384 to L-890; G-385 to L-890; P-386 to L- 
890; M-387 to L-890; G-388 to L-890; K-389 to L-890; H-390 to L-890; H-391 
to L-890; V-392 to L-890; M-393 to L-890; A-394 to L-890; P-395 to L-890; L- 
396 to L-890; F-397 to L-890; V-398 to L-890; H-399 to L-890; L-400 to L-890; 
N-401 to L-890; Q-402 to L-890; T-403 to L-890; L-404 to L-890; P-405 to L- 
890; W-406 to L-890; S-407 to L-890; P-408 to L-890; C-409 to L-890; S-410 
to L-890; A-41 1 to L-890; M-412 to L-890; Y-413 to L-890; L-414 to L-890; T- 
41 5 to L-890; E-416 to L-890; L-417 to L-890; L-41 8 to L-890; D-419 to L-890; 
G-420 to L-890; G-421 to L-890; H-422 to L-890; G-423 to L-890; D-424 to L- 
890; C-425 to L-890; L-426 to L-890; L-427 to L-890; D-428 to L-890; A-429 
to L-890; P-430 to L-890; G-43 1 to L-890; A-432 to L-890; A-433 to L-890; L- 
434 to L-890; P-435 to L-890; L-436 to L-890; P-437 to L-890; T-438 to L-890; 
G-439 to L-890; L-440 to L-890; P-441 to L-890; G-442 to L-890; R-443 to L- 
890; M-444 to L-890; A-445 to L-890; L-446 to L-890; Y-447 to L-890; Q-448 
to L-890; L-449 to L-890; D-450 to L-890; Q-451 to L-890; Q-452 to L-890; C- 
453 to L-890; R-454 to L-890; Q-455 to L-890; 1-456 to L-890; F-457 to L-890; 
G-458 to L-890; P-459 to L-890; D^60 to L-890; F-461 to L-890; R-462 to L- 
890; H-463 to L-890; C-464 to L-890; P-465 to L-890; N-466 to L-890; T-467 
to L-890; S-468 to L-890; A-469 to L-890; Q-470 to L-890; D-471 to L-890; V- 
472 to L-890; C-473 to L-890; A-474 to L-890; Q-475 to L-890; L-476 to L-890; 
W-477 to L-890; C-478 to L-890; H-479 to L-890; T-480 to L-890; D-48 1 to L- 
890; G-482 to L-890; A-483 to L-890; E-484 to L-890; P-485 to L-890; L-486 
to L-890; C-487 to L-890; H-488 to L-890; T-489 to L-890; K-490 to L-890; N- 
491 to L-890; G-492 to L-890; S-493 to L-890; L-494 to L-890; P-495 to L-890; 
W-496 to L-890; A-497 to L-890; D-498 to L-890; G-499 to L-890; T-500 to L- 
890; P-501 to L-890; C-502 to L-890; G-503 to L-890; P-504 to L-890; G-505 
to L-890; H-506 to L-890; L-507 to L-890; C-508 to L-890; S-509 to L-890; E- 
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5 1 0 to L-890; G-5 1 1 to L-890; S-5 1 2 to L-890; C-5 1 3 to L-890; L-5 1 4 to L-890; 
P-5 1 5 to L-890; E-5 1 6 to L-890; E-5 1 7 to L-890; E-5 1 8 to L-890; V-5 1 9 to L- 
890; E-520 to L-890; R-521 to L-890; P-522 to L-890; K-523 to L-890; P-524 
to L-890; V-525 to L-890; V-526 to L-890; D-527 to L-890; G-528 to L-890; G- 
529 to L-890; W-530 to L-890; A-53 1 to L-890; P-532 to L-890; W-533 to L- 
890; G-534 to L-890; P-535 to L-890; W-536 to L-890; G-537 to L-890; E-538 
to L-890; C-539 to L-890; S-540 to L-890; R-541 to L-890; T-542 to L-890; C- 
543 to L-890; G-544 to L-890; G-545 to L-890; G-546 to L-890; V-547 to L- 
890; Q-548 to L-890; F-549 to L-890; S-550 to L-890; H-551 to L-890; R-552 
to L-890; E-553 to L-890; C-554 to L-890; K-555 to L-890; D-556 to L-890; P- 
557 to L-890; E-558 to L-890; P-559 to L-890; Q-560 to L-890; N-56 1 to L-890; 
G-562 to L-890; G-563 to L-890; R-564 to L-890; Y-565 to L-890; C-566 to L- 
890; L-567 to L-890; G-568 to L-890; R-569 to L-890; R-570 to L-890; A-571 
to L-890; K-572 to L-890; Y-573 to L-890; Q-574 to L-890; S-575 to L-890; C- 
576 to L-890; H-577 to L-890; T-578 to L-890; E-579 to L-890; E-580 to L-890; 
C-581 to L-890; P-582 to L-890; P-583 to L-890; D-584 to L-890; G-585 to L- 
890; K-586 to L-890; S-587 to L-890; F-588 to L-890; R-589 to L-890; E-590 
to L-890; Q-591 to L-890; Q-592 to L-890; C-593 to L-890; E-594 to L-890; K- 
595 to L-890; Y-596 to L-890; N-597 to L-890; A-598 to L-890; Y-599 to L- 
890; N-600 to L-890; Y-601 to L-890; T-602 to L-890; D-603 to L-890; M-604 
to L-890; D-605 to L-890; G-606 to L-890; N-607 to L-890; L-608 to L-890; L- 1 
609 to L-890; Q-610 to L-890; W-61 1 to L-890; V-612 to L-890; P-613 to L- 
890; K-614 to L-890; Y-615 to L-890; A-616 to L-890; G-617 to L-890; V-618 
to L-890; S-619 to L-890; P-620 to L-890; R-621 to L-890; D-622 to L-890; R- 
623 to L-890; C-624 to L-890; K-625 to L-890; L-626 to L-890; F-627 to L-890; 
C-628 to L-890; R-629 to L-890; A-630 to L-890; R-63 1 to L-890; G-632 to L- 
890; R-633 to L-890; S-634 to L-890; E-635 to L-890; F-636 to L-890; K-637 
to L-890; V-638 to L-890; F-639 to L-890; E-640 to L-890; A-641 to L-890; K- 
642 to L-890; V-643 to L-890; 1-644 to L-890; D-645 to L-890; G-646 to L-890; 
T-647 to L-890; L-648 to L-890; C-649 to L-890; G-650 to L-890; P-651 to L- 
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890; E-652 to L-890; T-653 to L-890; L-654 to L-890; A-655 to L-890; 1-656 to 
L-890; C-657 to L-890; V-658 to L-890; R-659 to L-890; G-660 to L-890; Q-66 1 
to L-890; C-662 to L-890; V-663 to L-890; K-664 to L-890; A-665 to L-890; G- 
666 to L-890; C-667 to L-890; D-668 to L-890; H-669 to L-890; V-670 to L- 
5 890; V-671 to L-890; D-672 to L-890; S-673 to L-890; P-674 to L-890; R-675 

to L-890; K-676 to L-890; L-677 to L-890; D-678 to L-890; K-679 to L-890; C- 
680 to L-890; G-681 to L-890; V-682 to L-890; C-683 to L-890; G-684 to L- 
890; G-685 to L-890; K-686 to L-890; G-687 to L-890; N-688 to L-890; S-689 
to L-890; C-690 to L-890; R-691 to L-890; K-692 to L-890; V-693 to L-890; S- 

10 694 to L-890; G-695 to L-890; S-696 to L-890; L-697 to L-890; T-698 to L-890; 

P-699 to L-890; T-700 to L-890; N-701 to L-890; Y-702 to L-890; G-703 to L- 
890; Y-704 to L-890; N-705 to L-890; D-706 to L-890; 1-707 to L-890; V-708 
to L-890; T-709 to L-890; 1-710 to L-890; P-71 1 to L-890; A-712 to L-890; G- 
713 to L-890; A-714 to L-890; T-71 5 to L-890; N-716 to L-890; 1-717 to L-890; 

15 D-7 1 8 to L-890; V-71 9 to L-890; K-720 to L-890; Q-72 1 to L-890; R-722 to L- 

890; S-723 to L-890; H-724 to L-890; P-725 to L-890; G-726 to L-890; V-727 
to L-890; Q-728 to L-890; N-729 to L-890; D-730 to L-890; G-73 1 to L-890; N- 
732 to L-890; Y-733 to L-890; L-734 to L-890; A-735 to L-890; L-736 to L-890; 
K-737 to L-890; T-738 to L-890; A-739 to L-890; D-740 to L-890; G-741 to L- 

20 890; Q-742 to L-890; Y-743 to L-890; L-744 to L-890; L-745 to L-890; N-746 

to L-890; G-747 to L-890; N-748 to L-890; L-749 to L-890; A-750 to L-890; I- 
751 to L-890; S-752 to L-890; A-753 to L-890; 1-754 to L-890; E-755 to L-890; 
Q-756 to L-890; D-757 to L-890; 1-758 to L-890; L-759 to L-890; V-760 to L- 
890; K-761 to L-890; G-762 to L-890; T-763 to L-890; 1-764 to L-890; L-765 

25 to L-890; K-766 to L-890; Y-767 to L-890; S-768 to L-890; G-769 to L-890; S- 

770 to L-890; 1-771 to L-890; A-772 to L-890; T-773 to L-890; L-774 to L-890; 
E-775 to L-890; R-776 to L-890; L-777 to L-890; Q-778 to L-890; S-779 to L- 
890; F-780 to L-890; R-78 1 to L-890; P-782 to L-890; L-783 to L-890; P-784 to 
L-890; E-785 to L-890; P-786 to L-890; L-787 to L-890; T-788 to L-890; V-789 

30 to L-890; Q-790 to L-890; L-791 to L-890; L-792 to L-890; T-793 to L-890; V- 
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794 to L-890; P-795 to L-890; G-796 to L-890; E-797 to L-890; V-798 to L-890; 
F-799 to L-890; P-800 to L-890; P-801 to L-890; K-802 to L-890; V-803 to L- 
890; K-804 to L-890; Y-805 to L-890; T-806 to L-890; F-807 to L-890; F-808 
to L-890; V-809 to L-890; P-8 1 0 to L-890; N-8 1 1 to L-890; D-8 1 2 to L-890; V- 
5 813 toL-890;D-814toL-890;F-815to L-890; S-816 to L-890; M-817 to L-890; 

Q-818 to L-890; S-819 to L-890; S-820 to L-890; K-821 to L-890; E-822 to L- 
890; R-823 to L-890; A-824 to L-890; T-825 to L-890; T-826 to L-890; N-827 
to L-890; 1-828 to L-890; 1-829 to L-890; Q-830 to L-890; P-831 to L-890; L- 
832 to L-890; L-833 to L-890; H-834 to L-890; A-835 to L-890; Q-836 to L-890; 

10 W-837 to L-890; V-838 to L-890; L-839 to L-890; G-840 to L-890; D-84 1 to L- 

890; W-842 to L-890; S-843 to L-890; E-844 to L-890; C-845 to L-890; S-846 
to L-890; S-847 to L-890; T-848 to L-890; C-849 to L-890; G-850 to L-890; A- 
851 to L-890; G-852 to L-890; W-853 to L-890; Q-854 to L-890; R-855 to L- 
890; R-856 to L-890; T-857 to L-890; V-858 to L-890; E-859 to L-890; C-860 

15 to L-890; R-861 to L-890; D-862 to L-890; P-863 to L-890; S-864 to L-890; G- 

865 to L-890; Q-866 to L-890; A-867 to L-890; S-868 to L-890; A-869 to L-890; 
T-870 to L-890; C-871 to L-890; N-872 to L-890; K-873 to L-890; A-874 to L- 
890; L-875 to L-890; K-876 to L-890; P-877 to L-890; E-878 to L-890; D-879 
to L-890; A-880 to L-890; K-881 to L-890; P-882 to L-890; C-883 to L-890; E- 

20 884 to L-890; S-885 to L-890; of SEQ ED NO:4. 

Moreover, C-terminal deletions of the METH2 polypeptide can also be 
described by the general formula 1-n, where n is an integer from 2 to 890 where 
n corresponds to the position of amino acid residue identified in SEQ ED NO:4. 
Preferably, C-terminal deletions of the METH2 polypeptide of the invention 

25 shown as SEQ ED NO :4 include polypeptides comprising the amino acid sequence 

of residues: M-l to P-889; M-l to C-888; M-l to L-887; M-l to Q-886; M-l to 
S-885; M-l to E-884; M-l to C-883; M-l to P-882; M-l to K-881; M-l to A- 
880; M-l to D-879; M-l to E-878; M-l to P-877; M-l to K-876; M-l to L-875; 
M-l to A-874; M-l to K-873; M-l to N-872; M-l to C-871; M-l to T-870; M-l 

30 to A-869; M-l to S-868; M-l to A-867; M-l to Q-866; M-l to G-865; M-l to S- 
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864; M-l to P-863; M-l to D-862; M-l to R-861; M-l to C-860; M-l to E-859; 
M-l to V-858; M-l to T-857; M-l to R-856; M-l to R-855; M-l to Q-854; M-l 
to W-853; M-l to G-852; M-l to A-851; M-l to G-850; M-l to C-849; M-l to 
T-848; M-l to S-847; M-l to S-846; M-l to C-845; M-l to E-844; M-l to S-843; 
5 M-l toW-842;M-l toD-841;M-l toG-840;M-l toL-839;M-l toV-838;M-l 

to W-837; M-l to Q-836; M-l to A-835; M-l to H-834; M-l to L-833; M-l to 
L-832; M-l to P-83 1 ; M-l to Q-830; M-l to 1-829; M-l to 1-828; M-l to N-827; 
M-l to T-826; M-l to T-825; M-l to A-824; M-l to R-823; M-l to E-822; M-l 
toK-821;M-l toS-820;M-l toS-819;M-l toQ-818;M-l toM-817;M-l toS- 

10 816; M-l toF-815;M-l toD-814;M-l toV-813;M-l toD-812;M-l toN-811; 

M-l to P-810; M-l to V-809; M-l to F-808; M-l to F-807; M-l to T-806; M-l 
to Y-805;M-1 toK-804;M-l to V-803;M-1 toK-802;M-l toP-801;M-l toP- 
800; M-l to F-799; M-l to V-798; M-l to E-797; M-l to G-796; M-l to P-795; 
M-l to V-794; M-l to T-793; M-l to L-792; M-l to L-791; M-l to Q-790; M-l 

15 to V-789; M-l to T-788; M-l to L-787; M-l to P-786; M-l to E-785; M-l to P- 

784; M-l to L-783; M-l to P-782; M-l to R-781; M-l to F-780; M-l to S-779; 
M-l to Q-778; M-l to L-777; M-l to R-776; M-l to E-775; M-l to L-774; M-l 
to T-773; M-l to A-772; M-l to 1-771; M-l to S-770; M-l to G-769; M-l to S- 
768; M-l to Y-767; M-l to K-766; M-l to L-765; M-l to 1-764; M-l to T-763; 

20 M-l to G-762; M-l to K-761; M-l to V-760; M-l to L-759; M-l to 1-758; M-l 

to D-757; M-l to Q-756; M-l to E-755; M-l to 1-754; M-l to A-753; M-l to S- 
752; M-l to 1-751; M-l to A-750; M-l to L-749; M-l to N-748; M-l to G-747; 
M-l to N-746; M-l to L-745; M-l to L-744; M-l to Y-743; M-l to Q-742; M-l 
to G-741; M-l to D-740; M-l to A-739; M-l to T-738- M-l to K-737; M-l to 

25 L-736; M-l to A-735; M-l to L-734; M-l to Y-733; M-l to N-732; M-l to G- 

73 1; M-l to D-730; M-l to N-729; M-l to Q-728; M-l to V-727; M-l to G-726; 
M-l to P-725; M-l to H-724; M-l to S-723; M-l to R-722; M-l to Q-721; M-l 
to K-720; M-l to V-719; M-l to D-718; M-l to 1-717; M-l to N-716; M-l to T- 
715; M-l to A-714; M-l to G-713; M-l to A-712; M-l to P-71 1; M-l to 1-710; 

30 M-l to T-709; M-l to V-708; M-l to 1-707; M-l to D-706; M-l to N-705; M-l 
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to Y-704; M-l to G-703; M-l to Y-702; M-l to N-701; M-l to T-700; M-l to 
P-699; M-l to T-698; M-l to L-697; M-l to S-696; M-l to G-695; M-l to S-694; 
M-l to V-693; M-l to K-692; M-l to R-691; M-l to C-690; M-l to S-689; M-l 
to N-688; M-l to G-687; M-l to K-686; M-l to G-685; M-l to G-684; M-l to 
5 C-683; M-l to V-682; M-l to G-681; M-l to C-680; M-l to K-679; M-l to D- 

678; M-l to L-677; M-l to K-676; M-l to R-675; M-l to P-674; M-l to S-673; 
M-l to D-672; M-l to V-671; M-l to V-670; M-l to H-669; M-l to D-668; M-l 
to C-667; M-l to G-666; M-l to A-665; M-l to K-664; M-l to V-663; M-l to 
C-662; M-l to Q-661; M-l to G-660; M-l to R-659; M-l to V-658; M-l to C- 

10 657; M-l to 1-656; M-l to A-655; M-l to L-654; M-l to T-653; M-l to E-652; 

M-l to P-651; M-l to G-650; M-l to C-649; M-l to L-648; M-l to T-647; M-l 
to G-646; M-l to D-645; M-l to 1-644; M-l to V-643; M-l to K-642; M-l to A- 
641; M-l to E-640; M-l to F-639; M-l to V-638; M-l to K-637; M-l to F-636; 
M-l to E-635; M-l to S-634; M-l to R-633; M-l to G-632; M- 1 to R-63 1 ; M-l 

15 to A-630; M-l to R-629; M-l to C-628; M-l to F-627; M-l to L-626; M-l to K- 

625; M-l to C-624; M-l to R-623; M-l to D-622; M-l to R-621; M-l to P-620; 
M-l to S-619; M-l to V-618; M-l to G-617; M-l to A-616; M-l to Y-615; M-l 
to K-614; M-l to P-613; M-l to V-612; M-l to W-61 1; M-l to Q-610; M-l to 
L-609; M-l to L-608; M-l to N-607; M-l to G-606; M-l to D-605; M-l to M- 

20 604; M-l to D-603; M-l to T-602; M-l to Y-601; M-l to N-600; M-l to Y-599; 

M-l to A-598; M-l to N-597; M-l to Y-596; M-l to K-595; M-l to E-594; M-l 
to C-593; M-l to Q-592; M-l to Q-591; M-l to E-590; M-l to R-589; M-l to F- 
588; M-l to S-587; M-l to K-586; M-l to G-585; M-l to D-584; M-l to P-583; 
M-l to P-582; M-l to C-581 ; M-l to E-580; M-l to E-579; M-l to T-578; M-l 

25 to H-577; M-l to C-576; M-l to S-575; M-l to Q-574; M-l to Y-573; M-l to K- 

572; M-l to A-571; M-l to R-570; M-l to R-569; M-l to G-568; M-l to L-567; 
M-l to C-566; M-l to Y-565; M-l to R-564; M-l to G-563; M-l to G-562; M-l 
to N-561; M-l to Q-560; M-l to P-559; M-l to E-558; M-l to P-557; M-l to D- 
556; M-l to K-555; M-l to C-554; M-l to E-553; M-l to R-552; M-l to H-551; 

30 M-l to S-550; M-l to F-549; M-l to Q-548; M-l to V-547; M-l to G-546; M-1 
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to G-545; M- 1 to G-544; M- 1 to C-543 ; M-l to T-542; M- 1 to R-54 1 ; M- 1 to S- 
540; M l to C-539; M-l toE-538; M-l to G-537; M-l to W-536; M-l to P-535; 
M-l toG-534;M-l toW-533;M-l toP-532;M-l to A-531;M-1 toW-530;M-l 
to G-529; M-l to G-528; M-l to D-527; M-l to V-526; M-l to V-525; M-l to 
P-524; M-l to K-523; M-l to P-522; M-l to R-521; M-l to E-520; M-l to V- 
519; M-l to E-518; M-l to E-517; M-l to E-516; M-l to P-515; M-l to L-514; 
M-l to C-513; M-l to S-512; M-l to G-51 1; M-l to E-510; M-l to S-509; M-l 
to C-508; M-l to L-507; M-l to H-506; M-l to G-505; M-l to P-504; M-l to G- 
503; M-l to C-502; M-l to P-501; M-l to T-500; M-l to G-499; M-l to D-498; 
M-l to A-497; M-l to W-496; M-l to P-495; M-l to L-494; M-l to S-493; M-l 
to G-492; M-l to N-491; M-l to K-490; M-l to T-489; M-l to H-488; M-l to • 
C-487; M-l to L-486; M-l to P-485; M-l to E-484; M-l to A-483; M-l to G- 
482; M-l toD-481;M-l toT-480;M-l toH-479;M-l toC-478;M-l toW-477; 
M-l to L-476; M-l to Q-475; M-l to A-474; M-l to C-473; M-l to V-472; M-l 
to D-471; M-l to Q-470; M-l to A-469; M-l to S-468; M-l to T-467; M-l to N- 
466; M-l to P-465; M-l to C-464; M-l to H-463; M-l to R-462; M-l to F-461; 
M-l to D-460; M-l to P-459; M-l to G-458; M-l to F-457; M-l to 1-456; M-l 
to Q-455; M-l to R-454; M-l to C-453; M-l to Q-452; M-l to Q-451; M-l to 
D-450; M-l to L449; M-l to Q-448; M-l to Y-447; M-l to 1^146; M-l to A- 
445;M-1 toM-444;M-l toR-443;M-l toG-442;M-l toP-441;M-l toL-440; 
M-l to G-439; M-l to T-438; M-l to P-437; M-l to L-436; M-l to P-435; M-l 
to L-434; M-l to A-433; M-l to A-432; M-l to G-43 1 ; M-l to P-430; M-l to A- 
429; M-l to D-428; M-l to L-427; M-l to L-426; M-l to C-425; M-l to D-424; 
M-l to G-423; M-l to H-422; M-l to G-421 ; M-l to G-420; M-l to D-419; M-l 
to L418; M-l to L-417; M-l to E-416; M-l to T-415; M-l to L-414; M-l to Y- 
413; M-l to M-412; M-l to A-41 1; M-l to S-410; M-l to C-409; M-l to P-408; 
M-l to S-407; M-l to W-406; M-l to P-405; M-l to L-404; M-l to T-403; M-l 
to Q-402; M-l to N-401; M-l to L-400; M-l to H-399; M-l to V-398; M-l to 
F-397; M-l to L-396; M-l to P-395; M-l to A-394; M-l to M-393; M-l to V- 
392; M-l toH-391;M-l toH-390;M-l toK-389;M-l toG-388;M-l toM-387; 
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M-l to P-386; M-l to G-385; M-l to F-384; M-l to L-383; M-l to R-382; M-l 
to T-381; M-l to C-380; M-l to P-379; M-l to K-378; M-l to S-377; M-l to D- 
376; M-l to D-375; M-l to H-374; M-l to P-373; M-l to M-372; M-l to S-371; 
M-l to L-370; M-l to V-369; M-l to H-368; M-l to G-367; M-l to L-366; M-l 
to E-365; M-l to H-364; M-l to A-363; M-l to L-362; M-l to T-361; M-l to H- 
360; M-l to A-359; M-l to A-358; M-l to Q-357; M-l to L-356; M-l to G-355; 
M-l to E-354; M-l to D-353; M-l to E-352; M-l to 1-351; M-l to V-350; M-l 
to S-349; M-l to C-348; M-l to S-347; M-l to K-346; M-l to N-345; M-l to P- 
344; M-l to D-343; M-l to C-342; M-l to 1-341; M-l to T-340; M-l to G-339; 
M-l to 1-338; M-l to D-337; M-l to A-336; M-l to V-335; M-l to G-334; M-l 
to L-333; M-l to T-332; M-l to D-33 1; M-l to C-330; M-l to L-329; M-l to G- 
328; M-l to E-327; M-l to Q-326; M-l to G-325; M-l to C-324; M-l to F-323; 
M-l toN-322;M-l toQ-321;M-l toR-320;M-l toT-319; M-l toL-318;M-l 
to L-317; M-l to 1-316; M-l to A-315; M-l to T-314; M-l to D-313; M-l to Y- 
312; M-l toH-311;M-l toE-310;M-l toP-309; M-l toH-308;M-l toR-307; 
M-l to D-306; M-l to S-305; M-l to P-304; M-l to Q-303; M-l to N-302; M-l 
to F-301; M-l to R-300; M-l to R-299; M-l to Q-298; M-l to W-297; M-l to 
N-296; M-l to C-295; M-l to F-294; M-l to N-293; M-l to R-292; M-l to L- 
291; M-l to T-290; M-l to L-289; M-l to G-288; M-l to G-287; M-l to N-286; 
M-l to D-285; M-l to S-284; M-l to V-283; M-l to E-282; M-l to P-281; M-l 
to G-280; M-l to W-279; M-l to K-278; M-l to E-277; M-l to D-276; M-l to 
E-275; M-l to V-274; M-l to 1-273; M-l to L-272; M-l to V-271; M-l to K- 
270; M-l to V-269;M-ltoV-268;M-l toM-267;M-l toL-266;M-l toN-265; 
M-l to 1-264; M-l to S-263; M-l to N-262; M-l to K-261; M-l to 1-260; M-l 
to S-259; M-l to P-258; M-l to H-257; M-l to K-256; M-l to Y-255; M-l to I- 
254; M-l to R-253; M-l to A-252; M-l to A-251; M-l to V-250; M-l to S-249; 
M-l to M-248; M-l to L-247; M-l to T-246; M-l to L-245; M-l to 1-244; M-l 
to H-243; M-l to N-242; M-l to Q-241; M-l to L-240; M-l to D-239; M-l to 
A-238; M-l to G-237; M-l to Y-236; M-l to F-235; M-l to A-234; M-l to A- 
233; M-l to M-232; M-l to S-23 1 ; M-l to A-230; M-l to D-229; M-l to A-228; 
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M-l to V-227; M-l to L-226; M-l to L-225; M-l to T-224; M-l to E-223; M-l 
to V-222; M-l to F-221; M-l to R-220; M-l to A-219; M-l to E-21 8; M-l to S- 
217; M-l to V-216; M-l to F-215; M-l to R-214; M-l to K-213; M-l to T-212; 
M-l to R-21 1; M-l to S-210; M-l to T-209; M-l to A-208; M-l to G-207; M-l 
5 to L-206; M-l to P-205; M-l to P-204; M-l to P-203; M-l to P-202; M-l to E- 

201; M-l to S-200; M-l to A-199; M-l to G-198; M-l to E-197; M-l to A-196; 
M-l to E-195; M-l to E-194; M-l to E-193; M-l to Q-192; M-l to S-191; M-l 
to E-190; M-l toE-189; M-l to E-188; M-l to S-187; M-l to D-186; M-l to E- 
185;M-1 toQ-184;M-l toH-183;M-l toD-182;M-l toG-181;M-l toR-180; 

10 M-l to E-179; M-l to Q-178; M-l to R-177; M-l to Q-176; M-l to G-175; M-l 

to E-174; M-l to G-173; M-l to T-172; M-l to E-171; M-l to V-170; M-l to E- 
169; M-l toW-168;M-ltoE-167;M-l toP-166;M-l to G- 165; M-l toR-164; 
M-l to P-163; M-l to L-162; M-l to P-161; M-l to R-160; M-l to A-159; M-l 
to G-158; M-l to A-157; M-l to P-156; M-l to G-155; M-l to W-154; M-l to 

15 R-153; M-l to Q-152; M-l to L-151; M-l to R-150; M-l to H-149; M-l to P- 

148; M-l toQ-147;M-l toA-146;M-l toL-145;M-l to S-144; M-l toG-143; 
M-l to G-142; M-l to A-141; M-l to G-140; M-l to Q-139; M-l to P-138; M-l 
to Q-137; M-l to 1-136; M-l to T-135; M-l to F-134; M-l to E-133; M-l to E- 
132; M-l to G-131; M-l to D-130; M-l to L-129; M-l to L-128; M-l to F-127; 

20 M-l to S-126; M-l to G-125; M-l to S-124; M-l to L-123; M-l to G-122; M-l 

to R-121; M-l to C-120; M-l to L-119; M-l to S-l 18; M-l to V-l 17; M-l to A- 
116; M-l to A-l 15; M-l to L-l 14; M-l to S-l 13; M-l to E-l 12; M-l to P-l 1 1; 
M-l to E-l 10; M-l to G-109; M-l to N-108; M-l to V-107; M-l to T-106; M-l 
toG-105;M-l toS-104;M-l toF-103;M-l toF-102; M-l toC-101;M-l toG- 

25 100; M-l to R-99; M-l to L-98; M-l to G-97; M-l to R-96; M-l to E-95; M-l 

to G-94; M-l to G-93; M-l to T-92; M-l to A- 91; M-l to R-90; M-l to G-89; 
M-l to S-88; M-l to G-87; M-l to G-86; M-l to L-85; M-l to R-84; M-l to E- 
83; M-l to 1-82; M-l to K-81; M-l to F-80; M-l to E-79; M-l to P-78; M-l to 
A-77; M-l to L-76; M-l to F-75; M-l to S-74; M-l to D-73; M-l to D-72; M-l 

30 to P-71 ; M-l to A-70; M-l to L-69; M-l to R-68; M-l to L-67; M-l to V-66; M- 
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1 to F-65; M-l to G-64; M-l to K-63; M-l to G-62; M-l to F-61; M-l to A-60; 
M-l to S-59; M-l to L-58;,M-1 to H-57; M-l to L-56; M-l to A-55; M-l to L- 
54; M-l to E-53; M-l to G-52; M-l to A-5 1 ; M-l to S-50; M-l to G-49; M-l to 
P-48; M-l to L-47; M-l to R-46; M-l to T-45; M-l to P-44; M-l to V-43; M-l 
5 to V-42; M-l to L-41; M-l to E-40; M-l to S-39; M-l to A-38; M-l to Q-37; M- 

1 to G-36; M-l to G-35; M-l to A-34; M-l to A-33; M-l to P-32; M-l to R-31; 
M-l to A-30; M-l to P-29; M-l to A-28; M-l to G-27; M-l to R-26; M-l to A- 
25; M-l to L-24; M-l to P-23; M-l to L-22; M-l to L-21; M-l to L-20; M-l to 
L-19; M-l to L-18; M-l to L-l 7; M-l to L-16; M-l to L-15; M-l to L-14; M-l 
10 to F-13; M-l to P-12; M-l to L-l 1; M-l to W-10; M-l to R-9; M-l to P-8; M-l 

to A-7; of SEQ ID NO:4. Preferably, any of the above listed N- or C-terminal 
deletions can be combined to produce a N- and C-terminal deleted METH2 
polypeptide. 

The invention also provides polypeptides having one or more amino acids 

15 deleted from both the amino and the carboxyl termini, which may be described 

generally as having residues m-n of SEQ ID NO:2 or SEQ ID NO:4, where n and 
m are integers as described above. 

Also preferred are METH1 or METH2 polypeptide and polynucleotide 
fragments characterized by structural or functional domains. Preferred 

20 embodiments of the invention include fragments that comprise alpha-helix and 

alpha-helix forming regions ("alpha-regions"), beta-sheet and beta-sheet-forming 
regions ("beta-regions"), turn and turn-forming regions ("turn-regions"), coil and 
coil-forming regions ("coil-regions"), hydrophilic regions, hydrophobic regions, 
alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 

25 forming regions* substrate binding region, and high antigenic index regions. As 

set out in the Figures, such preferred regions include Garnier-Robson alpha- 
regions, beta-regions, turn-regions, and coil-regions, Chou-Fasman alpha-regions, 
beta-regions, and turn-regions, Kyte-Doolittle hydrophilic regions and 
hydrophobic regions, Eisenberg alpha and beta amphipathic regions, Karplus- 

30 Schulz flexible regions, Emini surface-forming regions, and Jameson-Wolf high 
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antigenic index regions. Polypeptide fragments of SEQ ID NO: 2 falling within 
conserved domains are specifically contemplated by the present invention. (See 
Figures 10 & 1 1 and Tables 1& 2.) Moreover, polynucleotide fragments encoding 
these domains are also contemplated. 

Other preferred fragments are biologically active METH1 or METH2 
fragments. Biologically active fragments are those exhibiting activity similar, but 
not necessarily identical, to an activity of the METH1 or METH2 polypeptide. 
The biological activity of the fragments may include an improved desired activity, 
or a decreased undesirable activity. 

However, many polynucleotide sequences, such as EST sequences, are 
publicly available and accessible through sequence databases. Some of these 
sequences are related to SEQ ID NO:l or SEQ ID NO:3 and may have been 
publicly available prior to conception of the present invention. Preferably, such 
related polynucleotides are specifically excluded from the scope of the present 
invention. To list every related sequence would be cumbersome. Accordingly, 
preferably excluded from the present invention are one or more polynucleotides 
comprising a nucleotide sequence described by the general formula of a-b, where 
a is any integer between 1 to 936 of SEQ ID NO: 1 , b is an integer of 1 5 to 950, 
where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:l, and where the b is greater than or equal to a + 14. Moreover, 
preferably excluded from the present invention are one or more polynucleotides 
comprising a nucleotide sequence described by the general formula of a-b, where 
a is any integer between 1 to 876 of SEQ ID NO:3, b is an integer of 1 5 to 890, 
where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:3, and where the b is greater than or equal to a + 14. 

Epitopes & Antibodies 

In the present invention, "epitopes" refer to METH1 or METH2 
polypeptide fragments having antigenic or immunogenic activity in an animal, 
especially in a human. A preferred embodiment of the present invention relates 
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to a METH1 or METH2 polypeptide fragment comprising an epitope, as well as 
the polynucleotide encoding this fragment. A region of a protein molecule to 
which an antibody can bind is defined as an "antigenic epitope." In contrast, an 
"immunogenic epitope" is defined as a part of a protein that elicits an antibody 
response. (See, for instance, Geysen etal., Proc. Natl. Acad. Sci. USA 57:3998- 
4002(1983).) 

Fragments which function as epitopes may be produced by any 
conventional means, (See, e.g., Houghten, R. A., Proc. Natl. Acad Sci. USA 
52:5131-5135 (1985) further described in U.S. Patent No. 4,631,21 1.) 

In the present invention, antigenic epitopes preferably contain a sequence 
of at least seven, more preferably at least nine, and most preferably between about 
15 to about 30 amino acids. Antigenic epitopes are useful to raise antibodies, 
including monoclonal antibodies, that specifically bind the epitope. (See, for 
instance, Wilson et aL, Cell 37:767-778 (1984); Sutcliffe, J. G. el al., Science 
27P:660-666 (1983).) 

Similarly, immunogenic epitopes can be used to induce antibodies 
according to methods well known in the art. (See, for instance, Sutcliffe et al. , 
supra; Wilson e/ a/., supra; Chow, M etal. y Proc. Nail. Acad Sci. USA £2:910- 
914; and Bittle, F. J. etal. y J. Gen. Virol 66:2347-2354 (1985).) A preferred 
immunogenic epitope includes the secreted protein. The immunogenic epitopes 
may be presented together with a carrier protein, such as an albumin, to an animal 
system (such as rabbit or mouse) or, if it is long enough (at least about 25 amino 
acids), without a carrier. However, immunogenic epitopes comprising as few as 
8 to 10 amino acids have been shown to be sufficient to raise antibodies capable 
of binding to, at the very least, linear epitopes in a denatured polypeptide (e.g., in 
Western blotting.) 

Using DNAstar analysis, SEQ ED NO: 2 was found antigenic at amino 
acids: 2-14,32-44,47-60,66-78,87-103, 109-118, 146-162, 168-180, 183-219, 
223-243, 275-284, 296-306, 3 14-334, 341-354, 357-376, 392-399, 401-41 0, 418- 
429, 438-454, 456-471, 474-488, 510-522, 524-538, 550-561, 565-626, 630-643, 
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659-671, 679-721, 734-749, 784-804, 813-820, 825-832, 845-854, 860-894, 899- 
917, 919-924 and 928-939. Thus, these regions could be used as epitopes to 
produce antibodies against the protein encoded by METH1 cDNA. 

Using DNAstar analysis, SEQ ID NO:4 was found antigenic at amino 
acids: 26-38, 45-52, 69-76, 80-99, 105-113, 129-136, 138-217, 254-263, 273- 
289,294-313,321-331,339-356,371-383,417-427,438-443,459-471,479-505, 
507-526, 535-546, 550-607, 615-640, 648-653, 660-667, 669-681, 683-704, 717- 
732, 737-743, 775-787, 797-804, 811-825, 840-867 and 870-884. Thus, these 
regions could be used as epitopes to produce antibodies against the protein 
encoded by METH2 cDNA. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) 
is meant to include intact molecules as well as antibody fragments (such as, for 
example, Fab and F(ab*)2 fragments) which are capable of specifically binding to 
protein. Fab and F(ab')2 fragments lack the Fc fragment of intact antibody, clear 
more rapidly from the circulation, and may have less non-specific tissue binding 
than an intact antibody. (Wahl et al, 1 Nucl. Med 24:316-325 (1983).) Thus, 
these fragments are preferred, as well as the products of a FAB or other 
immunoglobulin expression library. Moreover, antibodies of the present invention 
include chimeric, single chain, and humanized antibodies. 

Fusion Proteins 

Any METH1 or METH2 polypeptide can be used to generate fusion 
proteins. For example, the METH1 or METH2 polypeptide, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
METH1 or METH2 polypeptide can be used to indirectly detect the second 
protein by binding to the METH1 or METH2. Moreover, because secreted 
proteins target cellular locations based on trafficking signals, the METH1 or 
METH2 polypeptides can be used as a targeting molecule once fused to other 
proteins. 
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Examples of domains that can be fused to METH1 or METH2 
polypeptides include not only heterologous signal sequences, but also other 
heterologous functional regions. The fusion does not necessarily need to be 
direct, but may occur through linker sequences. 

Moreover, fusion proteins may also be engineered to improve 
characteristics of the METH1 or METH2 polypeptide. For instance, a region of 
additional amino acids, particularly charged amino acids, may be added to the N- 
terminus of the METH1 or METH2 polypeptide to improve stability and 
persistence during purification from the host cell or subsequent handling and 
storage. Also, peptide moieties may be added to the METH1 or METH2 
polypeptide to facilitate purification. Such regions may be removed prior to final 
preparation of the METH1 or METH2 polypeptide. The addition of peptide 
moieties to facilitate handling of polypeptides are familiar and routine techniques 
in the art. 

Moreover, METH1 or METH2 polypeptides, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. One reported 
example describes chimeric proteins consisting of the first two domains of the 
human CD4-polypeptide and various domains of the constant regions of the heavy 
or light chains of mammalian immunoglobulins. (EP A 394,827; Traunecker et 
al. y Nature 337:84-86 (1988).) Fusion proteins having disulfide-linked dimeric 
structures (due to the IgG) can also be more efficient in binding and neutralizing 
other molecules, than the monomelic secreted protein or protein fragment alone. 
(Fountoulakis et al., J. Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-0 464 533 (Canadian counterpart 2045869) discloses 
fusion proteins comprising various portions of constant region of immunoglobulin 
molecules together with another human protein or part thereof. In many cases, 
the Fc part in a fusion protein is beneficial in therapy and diagnosis, and thus can 
result in, for example, improved pharmacokinetic properties. (EP-A 0232 262.) 
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Alternatively, deleting the Fc part after the fusion protein has been expressed, 
detected, and purified, would be desired. For example, the Fc portion may hinder 
therapy and diagnosis if the fusion protein is used as an antigen for immunizations. 
In drug discovery, for example, human proteins, such as hIL-5, have been fused 
with Fc portions for the purpose of high-throughput screening assays to identify 
antagonists of hIL-5. (See, D. Bennett et al., J. Molecular Recognition 8:52-58 
(1995); K. Johanson et al., J. Biol. Chem. 270:9459-9471 (1995).) 

Moreover, the METH1 or METH2 polypeptides can be fused to marker 
sequences, such as a peptide which facilitates purification of METH1 or METH2. 
In preferred embodiments, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton 
Avenue, Chatsworth, CA, 91311), among others, many of which are commercially 
available. As described in Gentz et al, Proc. Natl: Acad Scl USA 56:821-824 
(1989), for instance, hexa-histidine provides for convenient purification of the 
fusion protein. Another peptide tag useful for purification, the "HA" tag, 
corresponds to an epitope derived from the influenza hemagglutinin protein. 
(Wilson et al., Cell 37:767 (1984).) 

Thus, any of these above fusions can be engineered using the METH1 or 
METH2 polynucleotides or the polypeptides. 

Biological Activities ofMETHl or METH2 

METH1 or METH2 polynucleotides and polypeptides can be used in 
assays to test for one or more biological activities. If METH1 or METH2 
polynucleotides and polypeptides do exhibit activity in a particular assay, it is 
likely that METH 1 or METH2 may be involved in the diseases associated with the 
biological activity. Therefore, METH1 or METH2 could be used to treat the 
associated disease. 
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Immune Activity 

METH1 or METH2 polypeptides or polynucleotides may be useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting 
the proliferation, differentiation, or mobilization (chemotaxis) of immune cells. 
Immune cells develop through a process called hematopoiesis, producing myeloid 
(platelets, red blood cells, neutrophils, and macrophages) and lymphoid (B and T 
lymphocytes) cells from pluripotent stem cells. The etiology of these immune 
deficiencies or disorders may be genetic, somatic, such as cancer or some 
autoimmune disorders, acquired (e.g., by chemotherapy or toxins), or infectious. 
Moreover, METH1 or METH2 polynucleotides or polypeptides can be used as a 
marker or detector of a particular immune system disease or disorder. 

METH1 or METH2 polynucleotides, or polypeptides may be useful in 
treating or detecting deficiencies or disorders of hematopoietic cells. METH1 or 
METH2 polypeptides or polynucleotides could be used to increase differentiation 
and proliferation of hematopoietic cells, including the pluripotent stem cells, in an 
effort to treat those disorders associated with a decrease in certain (or many) types 
hematopoietic cells. Examples of immunologic deficiency syndromes include, but 
are not limited to: blood protein disorders (e.g. agammaglobulinemia, 
dysgammaglobulinemia), ataxia telangiectasia, common variable 
immunodeficiency, Digeorge Syndrome, HIV infection, HTLV-BLV infection, 
leukocyte adhesion deficiency syndrome, lymphopenia, phagocyte bactericidal 
dysfiinction, severe combined immunodeficiency (SCDDs), Wiskott-Aldrich 
Disorder, anemia, thrombocytopenia, or hemoglobinuria. 

Moreover, METH1 or METH2 polypeptides or polynucleotides can also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity 
(clot formation). For example, by increasing hemostatic or thrombolytic activity, 
METH1 or METH2 polynucleotides or polypeptides could be used to treat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 
disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or 
other causes. Alternatively, METH1 or METH2 polynucleotides or polypeptides 
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that can decrease hemostatic or thrombolytic activity could be used to inhibit or 
dissolve clotting, important in the treatment of heart attacks (infarction), strokes, 
or scarring. 

METH1 or METH2 polynucleotides or polypeptides may also be useful 
in treating or detecting autoimmune disorders. Many autoimmune disorders result 
from inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction 
of the host tissue; Therefore, the administration of METH1 or METH2 
polypeptides or polynucleotides that can inhibit an immune response, particularly 
the proliferation, differentiation, or chemotaxis of T-cells, may be an effective 
therapy in preventing autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by 
METH1 or METH2 include, but are not limited to: Addison's Disease, hemolytic 
anemia, antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic 
encephalomyelitis, glomerulonephritis, Goodpasture's Syndrome, Graves 1 Disease, 
Multiple Sclerosis, Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, 
Pemphigus, Polyendocrinopathies, Purpura, Reiter's Disease, Stiff-Man Syndrome, 
Autoimmune Thyroiditis, Systemic Lupus Erythematosus, Autoimmune 
Pulmonary Inflammation, Guillain-Barre Syndrome, insulin dependent diabetes 
mellitis, and autoimmune inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly 
allergic asthma) or other respiratory problems, may also be treated by METH1 or 
METH2 polypeptides or polynucleotides. Moreover, METH1 or METH2 can be 
used to treat anaphylaxis, hypersensitivity to an antigenic molecule, or blood 
group incompatibility. 

METH1 or METH2 polynucleotides or polypeptides may also be used to 
treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through 
an immune response. Similarly, an immune response is also involved in GVHD, 
but, in this case, the foreign transplanted immune cells destroy the host tissues. 
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The administration of METH1 or METH2 polypeptides or polynucleotides that 
inhibits an immune response, particularly the proliferation, differentiation, or 
chemotaxis of T-cells, may be an effective therapy in preventing organ rejection 
or GVHD. 

Similarly, METH1 or METH2 polypeptides or polynucleotides may also 
be used to modulate inflammation. For example, METH1 or METH2 
polypeptides or polynucleotides may inhibit the proliferation and differentiation of 
cells involved in an inflammatory response. These molecules can be used to treat 
inflammatory conditicms, both chronic and acute conditions, including 
inflammation associated with infection (e.g., septic shock, sepsis, or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine 
or chemokine induced lung injury, inflammatory bowel disease, Crohn's disease, 
or resulting from over production of cytokines (e.g., TNF or IL-1 .) 

Hyperproliferaiive Disorders 

METH1 or METH2 polypeptides or polynucleotides can be used to treat 
or detect hyperproliferative disorders, including neoplasms. METH1 or METH2 
polypeptides or polynucleotides may inhibit the proliferation of the disorder 
through direct or indirect interactions. Alternatively, METH1 or METH2 
polypeptides or polynucleotides may proliferate other cells which can inhibit the 
hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, 
differentiating, or mobilizing T-cells, hyperproliferative disorders can be treated. 
This immune response may be increased by either enhancing an existing immune 
response, or by initiating a new immune response. Alternatively, decreasing an 
immune response may also be a method of treating hyperproliferative disorders, 
such as a chemotherapeutic agent. 
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Examples of hyperproliferative disorders that can be treated or detected 
by METH1 or METH2 polynucleotides or polypeptides include, but are not 
limited to neoplasms located in the: abdomen, bone, breast, digestive system, liver, 
pancreas, peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, 
ovary, thymus, thyroid), eye, head and neck, nervous (central and peripheral), 
lymphatic system, pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected 
by METH1 or METH2 polynucleotides or polypeptides. Examples of such 
hyperproliferative disorders include, but are not limited to: 
hypergammaglobulinemia, lymphoproliferative disorders, paraproteinemias, 
purpura, sarcoidosis, Sezary Syndrome, Waldenstron's Macroglobulinemia, 
Gaucher f s Disease, histiocytosis, and any other hyperproliferative disease, besides 
neoplasia, located in an organ system listed above. 

Infectious Disease 

METH1 or METH2 polypeptides or polynucleotides can be used to treat 
or detect infectious agents. For example, by increasing the immune response, 
particularly increasing the proliferation and differentiation of B and/or T cells, 
infectious diseases may be treated. The immune response may be increased by 
either enhancing an existing immune response, or by initiating a new immune 
response. Alternatively, METH1 or METH2 polypeptides or polynucleotides may 
also directly inhibit the infectious agent, without necessarily eliciting an immune 
response. 

Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by METH1 or METH2 polynucleotides 
or polypeptides. Examples of viruses, include, but are not limited to the following 
DNAand RNA viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, 
Birnaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, 
Flaviviridae, Hepadnaviridae (Hepatitis), Herpesviridae (such as, 
Cytomegalovirus, Herpes Simplex, Herpes Zoster), Mononegavirus (e.g., 
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Paramyxoviridae, Morbillivirus, Rhabdoviridae), Orthomyxoviridae (e.g., 
Influenza), Papovaviridae, Parvoviridae, Picornaviridae, Poxviridae (such as 
Smallpox or Vaccinia), Reoviridae (e.g., Rotavirus), Retroviridae (HTLV-I, 
HTLV-II, Lentivirus), and Togaviridae (e.g., Rubivirus). Viruses falling within 
these families can cause a variety of diseases or symptoms, including, but not 
limited to: arthritis, bronchiollitis, encephalitis, eye infections (e.g., conjunctivitis, 
keratitis), chronic fatigue syndrome, hepatitis (A, B, C, E, Chronic Active, Delta), 
meningitis, opportunistic infections (e.g., AIDS), pneumonia, Burkitt's 
Lymphoma, chickenpox , hemorrhagic fever, Measles, Mumps, Parainfluenza, 
Rabies, the common cold, Polio, leukemia, Rubella, sexually transmitted diseases, 
skin diseases (e.g., Kaposi's, warts), and viremia. METH1 or METH2 
polypeptides or polynucleotides can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms 
and that can be treated or detected by METH1 or METH2 polynucleotides or 
polypeptides include, but not limited to, the following Gram-Negative and Gram- 
positive bacterial families and fungi: Actinomycetales (e.g., Corynebacterium, 
Mycobacterium, Norcardia), Aspergillosis, Bacillaceae (e.g., Anthrax, 
Clostridium), Bacteroidaceae, Blastomycosis, Bordetella, Borrelia, Brucellosis, 
Candidiasis, Campylobacter, Coccidioidomycosis, Cryptococcosis, 
Dermatocycoses, Enterobacteriaceae (Klebsiella, Salmonella, Serratia, Yersinia), 
Erysipelothrix, Helicobacter, Legionellosis, Leptospirosis, Listeria, 
Mycoplasmatales, Neisseriaceae (e.g., Acinetobacter, Gonorrhea, Menigococcal), 
Pasteurellacea Infections (e.g., Actinobacillus, Heamophilus, Pasteurella), 
Pseudomonas, Rickettsiaceae, Chlamydiaceae, Syphilis, and Staphylococcal. 
These bacterial or fungal families can cause the following diseases or symptoms, 
including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., 
AIDS related infections), paronychia, prosthesis-related infections, Reiter's 
Disease, respiratory tract infections, such as Whooping Cough or Empyema, 
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sepsis, Lyme Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, food 
poisoning, Typhoid, pneumonia, Gonorrhea, meningitis, Chlamydia, Syphilis, 
Diphtheria, Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, 
tetanus, impetigo, Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, 
5 skin diseases (e.g., cellulitis, dermatocy coses), toxemia, urinary tract infections, 

wound infections. METH1 or METH2 polypeptides or polynucleotides can be 
used to treat or detect any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be 
treated or detected by METH1 or METH2 polynucleotides or polypeptides 

10 include, but not limited to, the following families: Amebiasis, Babesiosis, 

Coccidiosis, Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, 
Giardiasis, Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, 
Trypanosomiasis, and Trichomonas. These parasites can cause a variety of 
diseases or symptoms, including, but not limited to: Scabies, Trombiculiasis, eye 

15 infections, intestinal disease (e.g., dysentery, giardiasis), liver disease, lung disease, 

opportunistic infections (e.g., AIDS related), Malaria, pregnancy complications, 
and toxoplasmosis. METH1 or METH2 polypeptides or polynucleotides can be 
used to treat or detect any of these symptoms or diseases. 

Preferably, treatment using METH1 or METH2 polypeptides or 

20 polynucleotides could either be by administering an effective amount of METH1 

or METH2 polypeptide to the patient, or by removing cells from the patient, 
supplying the cells with METH1 or METH2 polynucleotide, and returning the 
engineered cells to the patient (ex vivo therapy). Moreover, the METH1 or 
METH2 polypeptide or polynucleotide can be used as an antigen in a vaccine to 

25 raise an immune response against infectious disease. 

Regeneration 

METH1 or METH2 polynucleotides or polypeptides can be used to 
differentiate, proliferate, and attract cells, leading to the regeneration of tissues. 
(See, Science 276:59-87 (1997).) The regeneration of tissues could be used to 
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repair, replace, or protect tissue damaged by congenital defects, trauma (wounds, 
burns, incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, 
periodontal disease, liver failure), surgery, including cosmetic plastic surgery, 
fibrosis, reperfusion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include 
organs (e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, 
skeletal or cardiac), vascular (including vascular endothelium), nervous, 
hematopoietic, and skeletal (bone, cartilage, tendon, and ligament) tissue. 
Preferably, regeneration occurs without or decreased scarring. Regeneration also 
may include angiogenesis. 

Moreover, METH1 or METH2 polynucleotides or polypeptides may 
increase regeneration of tissues difficult to heal. For example, increased 
tendon/ligament regeneration would quicken recovery time after damage. METH 1 
or METH2 polynucleotides or polypeptides of the present invention could also be 
used prophylactically in an effort to avoid damage. Specific diseases that could 
be treated include of tendinitis, carpal tunnel syndrome, and other tendon or 
ligament defects. A further example of tissue regeneration of non-healing wounds 
includes pressure ulcers, ulcers associated with vascular insufficiency, surgicaL, and 
traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using 
METH1 or METH2 polynucleotides or polypeptides to proliferate and 
differentiate nerve cells. Diseases that could be treated using this method include 
central and peripheral nervous system diseases, neuropathies, or mechanical and 
traumatic disorders (e.g., spinal cord disorders, head trauma, cerebrovascular 
disease, and stoke). Specifically, diseases associated with peripheral nerve 
injuries, peripheral neuropathy (e.g., resulting from chemotherapy or other medical 
therapies), localized neuropathies, and central nervous system diseases (e.g., 
Alzheimer's disease, Parkinson's disease, Huntington's disease, amyotrophic lateral 
sclerosis, and Shy-Drager syndrome), could all be treated using the METH1 or 
METH2 polynucleotides or polypeptides. 
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Chemotaxis 

METH 1 or METH2 polynucleotides or polypeptides may have chemotaxis 
activity- A chemotaxic molecule attracts or mobilizes cells (e.g., monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or 
endothelial cells) to a particular site in the body, such as inflammation, infection, 
or site of hyperproliferation. The mobilized cells can then fight off and/or heal the 
particular trauma or abnormality. 

METH1 or METH2 polynucleotides or polypeptides may increase 
chemotaxic activity of particular cells. These chemotactic molecules can then be 
used to treat inflammation, infection, hyperproliferative disorders, or any immune 
system disorder by increasing the number of cells targeted to a particular location 
in the body. For example, chemotaxic molecules can be used to treat wounds and 
other trauma to tissues by attracting immune cells to the injured location. As a 
chemotactic molecule, METH1 or METH2 could also attract fibroblasts, which 
can be used to treat wounds. 

It is also contemplated that METH1 or METH2 polynucleotides or 
polypeptides may inhibit chemotactic activity. These molecules could also be used 
to treat disorders. Thus, METH1 or METH2 polynucleotides or polypeptides 
could be used as an inhibitor of chemotaxis. 

Binding Activity 

METH 1 or METH2 polypeptides may be used to screen for molecules that 
bind to METH1 or METH2 or for molecules to which METH1 or METH2 binds. 
The binding of METH 1 or METH2 and the molecule may activate (agonist), 
increase, inhibit (antagonist), or decrease activity of the METH1 or METH2 or 
the molecule bound. Examples of such molecules include antibodies, 
oligonucleotides, proteins (e.g., receptors),or small molecules. 

Preferably, the molecule is closely related to the natural ligand of METH 1 
or METH2, e.g., a fragment of the ligand, or a natural substrate, a ligand, a 
structural or functional mimetic. (See, Coligan et al. y Current Protocols in 
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Immunology 7#):Chapter 5 (1991).) Similarly, the molecule can be closely 
related to the natural receptor to which METH1 or METH2 binds, or at least, a 
fragment of the receptor capable of being bound by METH1 or METH2 (e.g., 
active site). In either case, the molecule can be rationally designed using known 
techniques. 

Preferably, the screening for these molecules involves producing 
appropriate cells which express METH1 or METH2, either as a secreted protein 
or on the cell membrane. Preferred cells include cells from mammals, yeast, 
Drosophila, or E. coli. Cells expressing METH1 or METH2(or cell membrane 
containing the expressed polypeptide) are then preferably contacted with a test 
compound potentially containing the molecule to observe binding, stimulation, or 
inhibition of activity of either METH1 or METH2 or the molecule. 

The assay may simply test binding of a candidate compound toMETHl or 
METH2, wherein binding is detected by a label, or in an assay involving 
competition with a labeled competitor. Further, the assay may test whether the 
candidate compound results in a signal generated by binding to METH1 or 
METH2. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural 
product mixtures. The assay may also simply comprise the steps of mixing a 
candidate compound with a solution containing METH1 or METH2, measuring 
METH1 or METH2/molecule activity or binding, and comparing the METH1 or 
METH2/molecule activity or binding to a standard. 

Preferably, an ELISA assay can measure METH1 or METH2 level or 
activity in a sample (e.g., biological sample) using a monoclonal or polyclonal 
antibody. The antibody can measure METH1 or METH2 level or activity by 
either binding, directly or indirectly, to METH1 or METH2 or by competing with 
METH1 or METH2 for a substrate. 

All of these above assays can be used as diagnostic or prognostic markers. 
The molecules discovered using these assays can be used to treat disease or to 
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bring about a particular result in a patient (e.g., blood vessel growth) by activating 
or inhibiting the METH 1 or METH2 molecule. Moreover, the assays can discover 
agents which may inhibit or enhance the production of METH 1 or METH2 from 
suitably manipulated cells or tissues. 

Therefore, the invention includes a method of identifying compounds 
which bind to METH1 or METH2 comprising the steps of: (a) incubating a 
candidate binding compound with METH1 or METH2; and (b) determining if 
binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate 
compound with METH1 or METH2, (b) assaying a biological activity , and (b) 
determining if a biological activity of METH 1 or METH2 has been altered. 

Other Activities 

METH1 or METH2 polypeptides or polynucleotides may also increase or 
decrease the differentiation or proliferation of embryonic stem cells, besides, as 
discussed above, hematopoietic lineage. 

METH1 or METH2 polypeptides or polynucleotides may also be used to 
modulate mammalian characteristics, such as body height, weight, hair color, eye 
color, skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., 
cosmetic surgery). Similarly, METH 1 orMETH2 polypeptides or polynucleotides 
may be used to modulate mammalian metabolism affecting catabolism, anabolism, 
processing, utilization, and storage of energy. 

METH1 or METH2 polypeptides or polynucleotides may be used to 
change a mammal's mental state or physical state by influencing biorhythms, 
circadian rhythms, depression (including depressive disorders), tendency for 
violence, tolerance for pain, reproductive capabilities (preferably by Activin or 
Inhibin-like activity), hormonal or endocrine levels, appetite, libido, memory, 
stress, or other cognitive qualities. 

METH1 or METH2 polypeptides or polynucleotides may also be used as 
a food additive or preservative, such as to increase or decrease storage 
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capabilities, fat content, lipid, protein, carbohydrate, vitamins, minerals, cofactors 
or other nutritional components. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

Cancer Diagnosis and Prognosis 

It is believed that certain tissues in mammals with cancer express 
significantly diminished levels of the METH1 or METH2 protein and mRNA 
encoding the METH1 or METH2 protein when compared to a corresponding 
"standard" mammal, i.e., a mammal of the same species not having the cancer. 
Further, it is believed that diminished levels of the METH 1 or METH2 protein can 
be detected in certain body fluids (e.g., sera, plasma, urine, and spinal fluid) from 
mammals with cancer when compared to sera from mammals of the same species 
not having the cancer. Thus, the invention provides a diagnostic method useful 
during tumor diagnosis, which involves assaying the expression level of the gene 
encoding the METH 1 protein in mammalian cells or body fluid and comparing the 
gene expression level with a standard METH1 gene expression level, whereby a 
decrease in the gene expression level under the standard is indicative of certain 
tumors. The invention also provides a diagnostic method useful during tumor 
diagnosis, which involves assaying the expression level of the gene encoding the 
METH2 protein in mammalian cells or body fluid and comparing the gene 
expression level with a standard METH2 gene expression level, whereby a 
decrease in the gene expression level under the standard is indicative of certain 
tumors. 

Where a tumor diagnosis has already been made according to conventional 
methods, the present invention is useful as a prognostic indicator, whereby 
patients exhibiting diminished METH 1 or METH2 gene expression will experience 
a worse clinical outcome relative to patients expressing the gene at a lower level 
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By "assaying the expression level of the gene encoding the METH1 or 
METH2 protein" is intended qualitatively or quantitatively measuring or 
estimating the level of the METH1 or METH2 protein or the level of the mRNA 
encoding the METH1 or METH2 protein in a first biological sample either directly 
(e.g., by determining or estimating absolute protein level or mRNA level) or 
relatively (e.g., by comparing to the METH1 or METH2 protein level or mRNA 
level in a second biological sample). 

Preferably, the METH1 or METH2 protein level or mRNA level in the 
first biological sample is measured or estimated and compared to a standard 
METH1 or METH2 protein level or mRNA level, the standard being taken from 
a second biological sample obtained from an individual not having the cancer. As 
will be appreciated in the art, once a standard METH1 or METH2 protein level 
or mRNA level is known, it can be used repeatedly as a standard for comparison. 

By "biological sample" is intended any biological sample obtained from an 
individual, cell line, tissue culture, or other source which contains METH1 or 
METH2 protein or mRNA. Biological samples include mammalian body fluids 
(such as sera, plasma, urine, synovial fluid and spinal fluid) which contain secreted 
mature METH1 or METH2 protein, and adrenal, thyroid, stomach, brain, heart, 
placenta, lung, liver, muscle, kidney, pancreas, testis and ovarian tissue (for 
METH1 ); and prostate, small intestine, colon, brain and lung tissue (for METH2). 

The present invention is useful for detecting cancer in mammals. In 
particular the invention is useful during diagnosis of the of following types of 
cancers in mammals: breast, ovarian, prostate, liver, lung, pancreatic, colon, and 
testicular. Preferred mammals include monkeys, apes, cats, dogs, cows, pigs, 
horses, rabbits and humans. Particularly preferred are humans. 

Total cellular RNA can be isolated from a biological sample using the 
single-step guanidinium-thiocyanate-phenol-chloroform method described in 
ChomczynskiandSacchi,i4/2a/. J fi/oc/?ew. 752.156-159(1987). Levels of mRNA 
encoding the METH1 or METH2 protein are then assayed using any appropriate 
method. These include Northern blot analysis (Harada et al t Cell 63:303-3 12 
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(1990)), SI nuclease mapping (Fujita et al, Cell 49:357- 367 (1987)), the 
polymerase chain reaction (PCR), reverse transcription in combination with the 
polymerase chain reaction (RT-PCR) (Makino et aL, Technique 2:295-301 
(1990)), and reverse transcription in combination with the ligase chain reaction 
(RT-LCR). 

Assaying METH1 or METH2 protein levels in a biological sample can 
occur using antibody-based techniques. For example, METH1 or METH2 protein 
expression in tissues can be studied with classical immunohistological methods 
(Jalkanen, M, etal, J. Cell Biol 707:976-985 (1985); Jalkanen, M, et al, 1 
Cell. Biol 705:3087-3096(1987)). 

Other antibody-based methods useful for detecting METH1 or METH2 
protein gene expression include immunoassays, such as the enzyme linked 
immunosorbent assay (ELISA) and the radioimmunoassay (RIA). 

Suitable labels are known in the art and include enzyme labels, such as, 
glucose oxidase, and radioisotopes, such as iodine ( l25 I, l21 I), carbon ( 14 C), sulfur 
C 5 S), tritium ( 3 H), indium ( ll2 ln), and technetium (""Tc), and fluorescent labels, 
such as fluorescein and rhodamine, and biotin. 

Modes of administration 

It is recognized than an increase in the vascular supply plays a central role 
in tumor progression and metastasis; therefore, inhibitors of angiogenesis can 
prove effective as adjuvant therapy for cancer patients. Some of the currently 
recognized angiogenic suppressors are poor candidates for systemic treatment due 
to severe collateral effect. The present inventors have found that METH1 and 
METH2 are potent inhibitors of angiogenesis both in vitro and in vivo. The 
advantage of METH1 and METH1 is that these inhibitors are normally associated 
with suppression of physiological angiogenesis; therefore, they offer lack of 
toxicity and endothelial specificity over other angiogenic inhibitors. Furthermore, 
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METH1 and METH2 present a restricted pattern of expression providing a 
possible advantage on organ specificity. 

Accordingly, the polypeptides of the present invention may be employed 
to treat cancer. The METH1 and METH2 polypeptides of the present invention 
can also be used to treat individuals with other disorders that are related to 
angiogenesis, including abnormal wound healing, inflammation, rheumatoid 
arthritis, psoriasis, endometrial bleeding disorders, diabetic retinopathy, some 
forms of macula degeneration, hemangiomas, and arterial-venous malformations. 

Thus, the invention provides a method of inhibiting angiogenesis in an 
individual comprising administering to such an individual a pharmaceutical 
composition comprising an effective amount of an isolated METH1 polypeptide 
of the invention, effective to increase the METH1 activity level in such an 
individual. The invention also provides a method of inhibiting angiogenesis in an 
individual comprising administering to such an individual a pharmaceutical 
composition comprising an effective amount of an isolated METH2 polypeptide 
of the invention, effective to increase the METH2 activity level in such an 
individual. 

METH1 polypeptides which may be used to inhibit angiogenesis in this 
manner include: METH1 polypeptide encoded by the deposited cDNA including 
the leader; the mature METH1 polypeptide encoded by the deposited the cDNA 
minus the leader (i.e., the mature protein); a polypeptide comprising amino acids 
about 1 to about 950 in SEQ ID NO:2; a polypeptide comprising amino acids 
about 2 to about 950 in SEQ ID NO:2; a polypeptide comprising amino acids 
about 29 to about 950 in SEQ ID NO:2; a polypeptide comprising amino acids 
about 30 to about 950 in SEQ ID NO:2; a polypeptide comprising the 
metalloprotease domain of METH1, amino acids 235 to 459 in SEQ ID NO:2; a 
polypeptide comprising the disintegrin domain of METH 1 , amino acids 460 to 544 
in SEQ ID NO:2; a polypeptide comprising the first TSP-like domain of METH1, 
amino acids 545 to 598 in SEQ ED NO:2; a polypeptide comprising the second 
TSP-like domain of METH1, amino acids 841 to 894 in SEQ ID NO:2; a 
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polypeptide comprising the third TSP-like domain of METH1, amino acids 895 
to 934 in SEQ ID NO:2; a polypeptide comprising amino acids 536 to 61 3 in SEQ 
ID NO:2; and a polypeptide comprising amino acids 549 to 563 in SEQ ID NO:2. 
METH2 polypeptides which may be used to inhibit angiogenesis in this 
5 manner include: the METH2 polypeptide encoded by the deposited cDNA 

including the leader; the mature METH2 polypeptide encoded by the deposited the 
cDNA minus the leader (i.e., the mature protein); a polypeptide comprising amino 
acids about 1 to about 890 in SEQ ID NO:4; a polypeptide comprising amino ^ 
acids about 2 to about 890 in SEQ ID NO:4; a polypeptide comprising amino 

10 acids about 24 to about 890 in SEQ ID NO:4; a polypeptide comprising amino 

acids about 112 to about 890 in SEQ ID NO:4; a polypeptide comprising the 
metalloprotease domain of METH2, amino acids 214 to 439 in SEQ ID NO:4; a 
polypeptide comprising the disintegrin domain ofMETH2, amino acids 440 to 529 
in SEQ ID NO:4; a polypeptide comprising the first TSP-like domain of METH2, 

15 amino acids 530 to 583 in SEQ ID NO:4; a polypeptide comprising the second 

TSP-like domain of METH2, amino acids 837 to 890 in SEQ ID NO:4; a 
polypeptide comprising amino acids 280 to 606 in SEQ ID NO:4; and a 
polypeptide comprising amino acids 529 to 548 in SEQ ID NO:4. 

As a general proposition, the total pharmaceutical^ effective amount of 

20 METH1 or METH2 polypeptide administered parenterally per dose will be in the 

range of about 1 jig/kg/day to 10 mg/kg/day of patient body weight, although, as 
noted above, this will be subject to therapeutic discretion. More preferably, this 
dose is at least 0.01 mg/kg/day, and most preferably for humans between about 
0.01 and 1 mg/kg/day for the polypeptide. If given continuously, the METH1 or 

25 METH2 polypeptide is typically administered at a dose rate of about 1 jig/kg/hour 

to about 50 fig/kg/hour, either by 1-4 injections per day or by continuous 
subcutaneous infusions, for example, using a mini-pump. An intravenous bag 
solution may also be employed. 

Pharmaceutical compositions containing the METH1 or METH2 of the 

30 invention may be administered orally, rectally, parenterally, intracistemally, 
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intravaginally, intraperitoneal^, topically (as by powders, ointments, drops or 
transdermal patch), bucally, or as an oral or nasal spray. By "pharmaceutically 
acceptable carrier" is meant a non-toxic solid, semisolid or liquid filler, diluent, 
encapsulating material or formulation auxiliary of any type. The term "parenteral" 
as used herein refers to modes of administration which include intravenous, 
intramuscular, intraperitoneal, intrasternal, subcutaneous and intraarticular 
injection and infusion. 

Chromosome Assays 

The nucleic acid molecules of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can 
hybridize with a particular location on an individual human chromosome. The 
mapping of DNAs to chromosomes according to the present invention is an 
important first step in correlating those sequences with genes associated with 
disease. 

In certain preferred embodiments in this regard, the cDNA herein disclosed 
is used to clone genomic DNA of a METH1 or METH2 protein gene. This can 
be accomplished using a variety of well known techniques and libraries, which 
generally are available commercially. The genomic DNA then is used for in situ 
chromosome mapping using well known techniques for this purpose. 

In addition, in some cases, sequences can be mapped to chromosomes by 
preparing PCR primers (preferably 15-25 bp) from the cDNA. Computer analysis 
of the 3 ' untranslated region of the gene is used to rapidly select primers that do 
not span more than one exon in the genomic DNA, thus complicating the 
amplification process. These primers are then used for PCR screening of somatic 
cell hybrids containing individual human chromosomes. 

Fluorescence in situ hybridization ("FISH") of a cDNA clone to a 
metaphase chromosomal spread can be used to provide a precise chromosomal 
location in one step. This technique can be used with probes from the cDNA as 
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short as 50 or 60 bp. For a review of this technique, see Verma et ai t Human 
Chromosomes: A Manual Of Basic Techniques, Pergamon Press, New York 
(1988). 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in V. McKusick, Mendelian 
, Inheritance In Man, available on-line through Johns Hopkins University, Welch 
Medical Library. The relationship between genes and diseases that have been 
mapped to the same chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes). 

Next, it is necessary to determine the differences in the cDN A or genomic 
sequence between affected and unaffected individuals. If a mutation is observed 
in some or all of the affected individuals but not in any normal individuals, then the 
mutation is likely to be the causative agent of the disease. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

Examples 

Example 1; Identification and cloning of METH1 and METH2 

To search for novel genes with TSP-like domains, a large human cDNA 
database consisting of approximately 900,00 expressed sequence tags (ESTs) was 
screened for sequences homologous to the second type I repeat of TSP 1 . Several 
ESTs were predicted to encode proteins with TSP-like domains. Two cDNA 
clones originated from human heart and lung libraries were further sequenced and 
chosen for functional analysis. 
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The amino-terminal end of METH1 was obtained using 5' rapid 
amplification of cDNA ends (RACE) PCR technique (Marathon cDNA 
amplification kit, Clontech) according to manufacturer instructions. The amino- 
terminal end of METH2 was obtained partially through 5'RACE PCR and later 
confirmed and completed by genomic screening. For the genomic screen, BAC 
clones (Genome Systems) were initially identified by PCR. Positive BAC clones 
containing 150-200bp of sequence were subsequently subcloned into pGEM 
vector as small fragments and sequenced. 

Analysis and comparison of the deduced amino acid sequence with the 
GenBank, EMBL and SwissProt databases suggested that these genes belong to 
a new family of metalloproteases with homology to the reprolysin family in their 
NH2-terminal end and with several TSP-like motifs in the COOH-terminal end. 
These cDNAs were named METH1 and METH2; ME, for metalloprotease and 
TH, for thrombospondin. The mouse homologue of METH1 was identified and 
named ADAMTS1 (Kuno, K., etai t J. Biol Chem. 272:556-562(1997)). Direct 
comparison of the human and mouse sequences revealed a high level of 
conservation (83 .4% amino acid identity). Thus far no homologues for METH2 
have been identified. 

Interestingly, a recently identified protein named pNPI (procollagen I N- 
proteinase; (Colidge, A., etaL, Proc. Natl. Acad ScL USA 94:2374-2379 (1997)) 
showed a striking sequence and structural similarity to METH1 and METH2 
(Figure 3). As the novel proteins described here, pNPI also contains 
metalloproteinase (reprolysin subfamily) and TSP domains at the carboxy-terminal 
end. Although the sequence for pNPI is of bovine origin, sequence alignment 
revealed identical structural features. The amino acid similarity between METH1 
and METH2 is 5 1 .7%, and between METH1 or METH2 and pNPI the homology 
is lesser 33.9% and 36.3%, respectively. 

Sequence analysis showed that the ORF of METH1 and METH2 coded 
for proteins of 950 and 890 amino acids, respectively. In all three proteins, the 
NH 2 terminal end contains a putative signal peptide followed by another putative 
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transmembrane domain around amino acid 300, deduced from the hydrophilicity 
plots. It is not clear whether these proteins are bound to the membrane. 
However, given preliminary data, it is more likely that this second transmembrane 
domain will consist of a hydrophobic pocket and that METH1, METH2 and pNPI 
are in fact secreted proteins. The Nonterminal end past the signal peptide has 
homology to the superfamily of zinc metalloproteases and can be subdivided in a 
prodomain, a metalloprotease domain, and a cysteine-rich region. 

The double underlined sequence in METH1 and METH2 in Figure 3 
localized at the boundary between the prodomain and the metalloprotease domain, 
are potential cleavage sites for mammalian subtilisins, such as furins (Barr, 1991). 
Proteolytical processing occurs in SVMPs to yield soluble metalloproteases and 
disintegrins (Bjarnason, J.B. & Fox, J.W., Methods Enzymol 245:345-368 
(1 995)) and has also been detected in some ADAMs (reviewed by Wolsberg, T.G. 
& White, J.M., Developmental Biology 750:389-401 (1996)). At this point, 
preliminary experiments suggest that proteolytical processing occurs, at least in 
METH1. Additionally, both METH1 and METH2 present a Zn 2+ -binding site 
(dotted line in Figure 3) that is presumed to be catalytically active due to the 
conservation of certain functionally important amino acids (Rawlings, N.D. & 
Barrett, A.J., Methods Enzymol 245:183-228 (1995)) suggesting that these 
proteins may be active proteases. Following the metalloprotease domain, there 
is a cysteine-rich region which contains two putative disintegrin loops (Wolsberg, 
T.G. & White, J.M., Developmental Biology 750:389-401 (1996)) (marked by 
arrows in Figure 3). Disintegrin domains are found within the superfamily of 
metalloproteases in snake venom metalloproteases (SVMPs) and ADAMs 
(mammalian proteins containing a disintegrin and a metalloprotease domain) and 
have a possible function inhibiting binding of integrins to their ligands in SVMPs. 
Conversely, the ADAM-disintegrin-like domain, as part of membrane anchored 
proteins, may promote rather than disrupt, cell-cell interactions (Wolsberg, T.G. 
& White, J.M., Developmental Biology 750:389-401 (1996)). The TSP-like 
domains are located in the COOH-half of METH1 and METH2 proteins. METH1 
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contains two conserved TSP domains separated by a spacer region with unknown 
function, and a subdomain with less homology, and only 5 cysteines, following the 
second anti-angiogenic region. METH2 contains two TSP domains separated by 
the spacer region. The alignment of the TSP-like domains of METH1 and 
METH2 with those of TSP1 and TSP2 are shown in Figure 5. The homology 
varies between 19.2% to 52% amino acid similarity among all the TSP repeats. 
The cysteines, numbered 1 to 6, and the tryptophans, labeled by asterisks, are 
highly conserved. 

Southern blot of human genomic DNA revealed the presence of METH1 
and METH2 in the genome. METH1 and METH2 probes revealed bands of 
different size suggesting that they are transcribed from different genes. 

The consensus sequence for the type I repeats includes 16 residues with 
6 perfectly conserved cysteines. Typically it begins with the sequence motif 
WSXWS (SEQ ID NO:82) that has also been shown to bind to heparin (Guo, R, 
et cd. % J. Biol Chem. 267:19349-19355 (1992)). The affinity of this region to 
heparin has been proposed to the part of the anti-angiogenic activity of TSP- 1 
(Guo, N., et al t J. Peptide Res. 49 (1 997)). Among the five members of the TSP 
family of proteins, only TSP- 1 and TSP-2 inhibit angiogenesis and contain the type 
I repeats (Tolsma, S.S., et aL, J. Cell. Biol. 722:497-511 (1993); Kyriakides, 
T.R., et al., J. Cell Biol 740:419-430 (1998)). The type I or properdin repeats 
were probably added to the precursor of TSP 1 and 2 by exon shuffling between 
500 and 900 years ago (Adams, J., et aL, The Thrombospondin Gene Family, 1 
Ed. Molecular Biology Intelligence Unit (Springer, Ed.), R.G. Landes Company, 
Germany (1995)). It is likely that the acquisition of this domain provided the 
precursor of TSP 1 and TSP2 with functions, such as regulation of new vessel 
formation. More recently, BAI-1 (brain angiogenic inhibitor- 1 ), a protein isolated 
from a brain library for its ability to be regulated by p53, has also been shown to 
contain the type I repeat of TSP- 1 and to provide anti-angiogenic potential to this 
molecule (Nishimori, H., etal, Oncogene 75:2145-2150 (1997)). Nevertheless, 
it appears that additional sequences or context are also important, since other 



WO 99/37660 



-117- 



PCIYUS99/01313 



proteins containing the typelrepeats appear not to have clear or more established 
anti-angiogenic properties such as: properdin, F-spondin, and other members of 

the complement family. 

Because of the presence of TSP-repeats in METH1 and METH2, along 

5 with their anti-angiogenic properties, these proteins were originally considered 

membersoftheTSPsuperfamily. Nevertheless, they have no additional homology 
to other TSPs, and in fact, the similarity to TSP1 and TSP2 is restricted to the 
type I repeats. Furthermore, the proteins also have strong sequence and structural 
homology to members of the ADAM family. These features led Kuno and 

10 colleagues to name ADAMTS to the mouse homolog of METH1 (Kuno, K, et 

al, J. Biol. Chem. 272:556-562 (1997)). The recent identification of pNPI and 
its striking sequence homology to the proteins here described, prompt all these 
three proteins to be grouped in a subfamily named metallospondins. At this point, 
it is not clear whether pNIP has anti-angiogenic properties or whether METH1 

15 and/or METH2 participate in the cleavage of the amino terminal pro-peptide of 

a 1(1) procollagen. 

Example 2: Northern and Southern blot analysis 

Total RNA was purified from cells by guanidinium-isothiocyanate 
extraction, as previously described (Chomczynski, P. & Sacchi, N., Anal. 

20 Biochem. 762:156-159 (1 987)) Poly(A)+RNA was extracted using a Boehringer 

Mannheim (BMB, Indianapolis, IN) kit according to the manufacturer conditions. 
Other poly(A>+-RNA blots were purchased from Clontech (Palo Alto, CA). Pre- 
hybridization was performed in a solution containing. 50% formamide, 6X SSPE, 
lXDenhardt's solution, 0.1% SDS and lOOug/ml of heat denatured salmon sperm 

25 DNA for 12-18h at 42°C. Hybridization with labeled cDNA probes proceeded 

inthesamesolutionat42Xforl2-18h. TSP1 and METH1 probes corresponded 
to the entire human cDNAs. METH2 probe corresponded to a Kpnl-EcoFl 
fragment from the human cDNA A 1.3Kb />^I fragment of the glyceraldehyde-3- 
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phosphate-dehydrogenase (GPDH) was used to nonnalize for loading and transfer 
efficiency. Membranes were exposed to Kodak Biomax MS film (Kodak, New 
Haven, CT). 

For Southern blots, human genomic DNA, purchased from Promega 
5 (Madison, WI), was heated at 65°C for 10 min and digested with£coRI and Psil 

overnight at 37°C. 5ug of digested DNA was separated in a 1% agarose gel, 
transferred to a nytran membrane and cross-linked by ultraviolet light. cDNA 
probes, as well as, prehybridization and hybridization conditions were identical to 
those described for Northern blots. Blots were washed with high stringency (0.2X 
10 SSC, 0.2% SDS at 50°C). ' 

The expression pattern of METHl and METH2 was examined in both 
adult and embryonic tissues. Northern blot analysis was performed under high- 
stringency conditions with blots that included poly(A)+RN A from human tissues. 
METHl and METH2 transcripts revealed a single band of 4.6 and 3.7Kb, 
15 respectively. Abundant METHl mRNA expression was observed in adrenal, 

heart, placenta, followed by skeletal muscle, thyroid and stomach. From the 
embryonic tissues analyzed, kidney showed the highest expression of METHl 
mRNA. Nevertheless, weaker expression of METHl mRNA was seen in all 
tissues analyzed. Distribution of METH2 mRNA was more restricted and weaker 
20 than that of METHl . The highest expression was seen in lung, both embryonic 

and adult. Interestingly, METHl and METH2 expression do not appear to 
overlap. In combination, the structural similarities and their pattern of expression 
suggest functional redundancy yet different transcriptional regulation. The 
expression levels of TSP1 transcripts in the same blots were also analyzed, for 
25 purpose of comparison. TSP1 mRNA highest expression was seen in the adult 

placenta and in all embryonic tissues analyzed. In contrast to METHl and 
METH2 we observed constant levels of TSP1 transcript in all the other tissues 
examined. 

The cell type distribution was also studied by Northern blot analysis of 
30 poly(A)+RNA. METHl mRNA was detectable, at low levels, in dermal 
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fibroblasts, vascular smooth muscle, endometrial stromal cells, and in two cancer 
cell lines, HeLa and G631, an adenocarcinoma and a melanoma, respectively. 
METH2 mKNA was detected only on SW480, a colon carcinoma cell line, but no 
expression was seen in any other of the cell lines or primary strains analyzed. 

The possibility that groups of angiogenic and anti-angiogenic factors 
regulate vascular network formation in specific organs has been a frequently 
discussed hypothesis likely to be true, yet unproven. The expression patterns of 
METH 1 and METH2, which are clearly distinct and almost non-overlapping, were 
puzzling, at least with concern to overall levels. TSP1 and TSP2 also share 
identical structure, high level of amino acid similarity, yet their pattern of 
expression differs significantly (Iruela-Arispe, M.L.,Z)ev. Dyn 797:40-56 (1993)). 
The differences are likely based on dissimilar cis-acting elements in their 
promoters and different regulatory mechanisms, as previously suggested. 
Although the promoters for METH1 and 2 have not been characterized, itislikely 
that they provide unique features for the regulation of each gene. Nevertheless, 
the possibility that one motif, the anti-angiogenic / type I repeat, with 
demonstrated anti-angiogenic properties is present in several proteins with 
different tissue specificities is appealing! Alternatively, the small differences in 
sequence between closely related members of the same family could possess 
significance that goes beyond functional redundancy. In the case of TSP1 and 
TSP2, aside from the striking structural similarities and perhaps having 
functionally common anti-angiogenic properties, TSP1 and TSP2 also appear to 
display functions of their own and not likely shared by their similar relative. This 
became evident with the outcome of the two knock-outs for these genes. TSP1 
null animals exhibited primarily lung disorders (Lawler, J., et aL. J. Clin. Invest. 
707:982-992 (1998)) and secondarily vascular abnormalities, but only under 
specific pathological settings or on a restricted set of organs. In contrast TSP2 
knock-out mice exhibited unpredicted collagen assembly anomalies, with carry-on 
consequences to the skin, tendons, and bone (Kyriakides, T.R., et al, J. Cell Biol. 
7^0:419-430 (1998)). In addition, these animals also appear to have overall 
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increase in capiUary density in the dermis. It is not understood how the 
resemblance between the newly described members of the metallospondin family 
translate functionally. Clearly, pND> has been shown to display active proteolytic 
activity by cleavmg the N-terrmnusoftype I procollagen (Colidge, A., e/a^^ 
Natl. Acad. Sci. USA 9*2374-2379 (1997)). 

A second region of functional interest corresponds to the disintegrin 
domain. This domain has been more fully characterized in related members of the 
snake venom metalloproteases that havebeen shown to bind toaUbP3 and inhibit 

platelet interaction blocking coagulation (Pfaff, M., et al., CellAdhes Commun. 
2:491-501 (1994); Usami, Y., etal. Biochem. Biophys. Res. Commun. 207:331- 
339 (1994)). The disintegrin motif consists of a thirteen to fifteen domain which 
frequently contain an RGD or a negatively charged residue at the position of the 
aspartic acid. The RGD, or equivalent, binds to integrins and serve as antagonist 
or signaling ligands (Wolsberg, T.G. & White, J.M., Developmental Biology 
750:389-401 (1996)). METH2, but not METH1, has an RGD sequence located 
amino-terminal to the disintegrin domain. In addition, both molecules present 
relatively high, but not perfect, degree of conservation of cysteines within the 
disintegrin motif. This appears to display an important role in the tertiary 
structure of this region and its ability to interact with integrins. In addition, some 
of these domains have been shown to act as functional adhesion molecules, 
particularly those with transmembrane regions (Wolsberg, T.G. & White, J.M., 
Developmental Biology 7*0:389-401 (1996)). It is unlikely that this will be the 
case for METH1 and METH2, since both these proteins appear to be secreted. 

Example 3: Expression and purification of recombinant proteins 

Recombinant constructs for expression of truncated fusion proteins were 
as follows: (1) pRSET-METHl -Type I. METH1 nt 1605-1839 (from the start 
codon) was amplified by polymerase chain reaction using the following primers: 
5 '-GCA TTT TGG ATC CGC CTT TTC ATG-3' (SEQ ID N0.78) and 5'-GTT 
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GTG TGC TGC AGA TTG TTC C-3 ' (SEQIDNO:79). The amplified fragment 
was then subcloned into the BamHl and Pstl sites of the pRSET vector; (2) 
pGEX-METHl-TSP was generated by ligating theBamlU-EcoRl fragment from 
the pRSET-METH 1 -TSP into the Smal site of the pGEX-5X vector (Pharmacia 
Biotech Inc., Piscataway, NJ) by blunt-end ligation; (3) pGEX-1.0-METH2: the 
fragment nt 838-1818 of METH2 cDNA (from the start codon) was ligated into 
Bamm-EcoRl sites of pGEM-2TK. The METH2 fragment was amplified by PCR 
usingthe Mowing primers: 5 '-GAAAAATGGGGATCCGAGGTG-3 ' (SEQID 
NO.80) and 5 ' -GC AGGAGAATTCCGTCCATG-3 ' (SEQ IDNO:81) to generate 
Bamm and £coRI restriction sites; (4) pGEX-METH2-TSP: a 0.5Kb Xmal- 
EcoRl fragment isolated from pGEX-1.0-METH2 was subcloned into the Xma\ 
and EcoRl sites of pGEX-2TK vector. All constructs were sequenced to verify 
sequence fidelity and correct open reading frame. 

The recombinant proteins were named 6H-METH1, the recombinant 
protein expressed with the plasmid pRSET-METH 1 -TSP, GST-METH1, the 
protein expressed with the plasmid pGEX-METHl-TSP and GST-METH2, the 
protein expressed with the plasmid pGEX-METH2-TSP. 

Expression plasmids were transformed into BL21:DE3 E. coli strain 
(Stratagene Cloning Systems, La Jolla, CA) and fusion proteins were induced 
following manufacturer recommendations. Briefly, induced bacteria pellets were 
resuspended in PBS and sonicated on ice for 1 min. The suspension was, 
subsequentiy, incubated at RT for 20min in the presence of 1% triton X-100 and 
centrifuged at 4°C. ffistidine tagged fusion proteins were then purified on Ni- 
NTA beads (Qiagen, Chatsworth, CA) by incubating 20ml of supernatant with 1ml 
of beads (50% slurry) for 2h at 4°C. The suspension was transferred into a 
column and washed with 1 0 columns volume of PBS containing lOmM imidazole, 
followed by 50mM imidazole and finally lOOmM imidazole. The protein was 
eluted with 500mM imidazole in PBS. Fractions containing the recombinant 
protein were dialyzed against phenol-red free DMEM. Samples were centrifuged 
for 30min at 4°C, part of the protein was not soluble and was lost during 
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centrifugation. The supernatant was stored at -70°C and used for proliferation, 
cornea pocket and chorioallantoic membrane (CAM) assays. 

For purification of GST-fusion proteins, the extract was cleared by 
centrifugation and applied to a GST-affinity column (Pharmacia). The column 
5 was washed with PBS-1% triton X-100 in the presence of O.lmM reduced 

glutathione and, subsequently, with the same buffer in the presence of O.SmM 
reduced glutathione. Fusion proteins were eluted with lOmM reduced glutathione 
in 50mM Tris-HCl, pH 7.5 . Fractions containing the protein were dialyzed against 
DMEM, stored at -70°C and used for proliferation, cornea pocket and 
10 chorioallantoic membrane (CAM) assays. 

Integrity and purity of recombinant proteins was analyzed in 12.5% or 
15% acrylamide gels stained with Coomassie blue. 

A recombinant GST fusion protein containing the first two type I repeats 
of TSP was also dialyzed against DMEM before used in functional assays. Intact 
15 TSP1 was purified from platelets as previously described (Roberts, D.D., et aL, 

J. Tissue Cult. Methods 76:217-222 (1994)). 

To test the hypothesis that METH1 and METH2 TSP domains could 
functionasregulatorsofangiogenesis recombinant wsionproteinswere 

in bacteria. The constructs included the first TSP domain of METH1 or METH2. 

20 This domain is the most conserved, 52% amino acid similarity with the second 

type I repeat of TSP1, (this domain contains a putative binding site for CD36). 
All recombinant proteins were isolated under native conditions to preserve their 
secondary structure asmuchas possible. 6H-METH1 and GST-METH1 contained 
the first TSP-like domain of METH1 fused to a histidine tag or a GST, 

25 respectively. METH1 recombinant protein was made with two different tags 

because of purification and structural advantages. The differences in size are due 
to the size of the tag, 6KDa the histidine and 27KDa the GST. GST-METH2 
contained the first TSP domain of METH2 also fused to a GST. A fragment 
corresponding to the last two type 1 repeats of TSP1, also fused to a GST, and 
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intact TSP1 purified from platelets were used as positive controls. In addition, 
GST alone was included in all experiments as negative control. 

Example* TSP domains in mTHl and mm2 disrupt oncogenesis in vivo 

Cornea pocket assay 

Swiss Webster females and males, were purchased from Charles River 
(Boston, MA) and used between 8-10 weeks-old for implantation of the pellets. 
Cornea pockets were performed as described by Kenyon and colleagues (Kenyon, 
B.M., et al, Invest Ophthalmol. Vis. Sci. 37:1625-1632 (1996)) with few 
modifications. Briefly, a solution of lOug of recombinant bFGF plus 5 mg of 
sucralfate were mixed with 10ul of Hydron (200mg/ml in ethanol; New 
Brunswick, NJ) and the recombinant protein of interest (2pg). The suspension 
was then smeared onto a sterile nylon mesh square (pore size 500pm; Tetko Inc., 
Briarcliff Manor, NY) and allowed to dry for 30min. The fibers of the mesh were 
pulled to produce pellets of 500pm 3 that were stored at -20°C. Uniformly sized 
pellets were selected under a microscope and used for the assays. 

Mice were anesthetized with Avertin. An incision was made in the cornea 
using a Nikon SMZ-U dissecting microscope with the aid of a surgical blade. A 
single pellet was implanted into the pocket. Five days after pellet implantation, 
corneal angiogenesis was evaluated and photographed. 



20 CAM assay 

Chorioallantoic membrane assays were performed on Leghorn chicken 
embryos (SPAFAS, MA) at 12-14 days of embryonic development. Matrigel 
(750pg/ml), VEGF (250ng/mesh) and the protein or peptide to be tested were 
mixed, placed onto nylon meshes (pore size 250pm; Tetko Inc.) and incubated 

25 sequentially at 37°C for 30min and at 4°C for 2h to induce polymerization. A 

positive (matrigel and VEGF) and a negative (VEGF alone) control were also 
prepared for each CAM. Polymerized meshes were placed onto the third outer 
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region, of the CAM and incubated for 24h. To visualize vessels, 400ul of 
fluorescein isothiocyanate dextran (lOmg/ml, SIGMA) was injected in the chick 
blood stream. After 5-10min incubation, the chick was topically fixed with 3.7% 
formaldehyde for 5min. The meshes were then dissected and mounted onto slides. 
Fluorescence intensity was analyzed with a computer-assisted image program 

(NIH Image 1.59). 

Peptides used on these assays were synthesized by Chiron (Raleigh, NC). 
Sequence corresponded to amino acids: P-TSP1, 430-447; P-METH1, 549-563; 

P-METH2, 529-548. 

The evaluation of angiogenic or anti-angiogenic responses relies heavily 
on the sensitivity and specificity of the assays used to assess the response. To 
evaluate the anti-angiogenic activity of these fragments in vivo, two popular and 
well-accepted angiogenesis assays were used: the corneal pocket and the 
chorioallantoic membrane. The visibility, accessibility, and avascularity of the 
5 cornea are highly advantageous and facilitate the visualization of the neovascular 

response and the topical application of the test substances. A known amount of 
angiogenesis factor(s)is implanted, as a pellet, in a pocket made in the comea eye. 
To test an angiogenesis inhibitor, the molecule is implanted with the stimulator in 
the same pellet, and the response is compared to the stimulator alone. 
, 0 i„ these experiments, bFGF was used as the vascularization stimulator. 

Pellets containing the recombinant protein were implanted in mouse corneas and 
theirability to inhibit the bFGF-induced angiogenic response was compared to that 
of controls. When a bFGF pellet containing GST was implanted new capillary 
vessels grew from the cornea limbus, across the cornea and into the pellet within 
25 5 days. In contrast, addition of GST-METH1 or GST-METH2 to the bFGF 

pellets completely abolished blood vessel growth. Table 4 contains a summary of 
theresultsobtainedfrom41 assays performed. Intact TSP1 purified from platelets 
and GST-TSP1 were used as positive controls. All assays were performed at 
identical concentrations, suggesting that METHl and METH2 have similar 
30 potency to that of TSP1 in the inhibition of angiogenesis. In addition, when half 
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ofthe standard concentration was used, a weak, however noticeable response was 
seen, indicating a dose-dependent effect. 



Activity of METH1 and METH2 recombinant proteins in the corneal 

pocket assay 


bFGF Pellets 


Vascularized corneas/Total corneas 


Vehicle 


5/5 


TSP1 


0/5 


GST 


11/11 


GST-TSP1-T1 


1/4 


GST-METH 1 -TSP 


0/8 


GST-METH2-TSP 


0/8 



In the CAM assay, the angiogenic response is analyzed by measuring the 
number of vessels that grow within a matrix polymer containing the angiogenic 
growthfactor. To determine whether recombinant METH1 and METH2 proteins 
inhibited neovascularization in the CAM assay induced by VEGF, a matrigel 
polymer containing VEGF and the recombinant protein were implanted in the 
CAM. Quantitative analysis ofthe experiments, which included three different 
polymers per treatment are shown in Figure 6A. Matrigels polymers containing 
VEGF phis 5 M g of GST-METHl or GST-METH2 caused greater than 80% 
inhibition in blood vessel growth. A similar potency was found using the GST 
recombinant protein derived from the type I repeats of TSP1 Furthermore, the 
anti-angiogenic effect ofthe TSP domains in METH1 and METH2 was dose- 
dependent with a complete inhibition of blood vessel growth when 15 Mg/ml of 
protein was used (Figure 6C and D). GST alone, at identical concentrations, had 
no significant effect on VEGF-stimulated angiogenesis. 
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Synthetic peptides from the second or the third type I repeats of human 
TSP 1 can mimic that anti-angiogenic effects of the intact TSP 1 (Tolsma, S. S., ei 
al. t J. Cell Biol 722:497-511 (1993)). In fact, a 19-residue polypeptide was 
shown to be sufficient to block in vivo neovascularization in the rat cornea and to 
inhibit the bFGF-induced migration of cultured endothelial cells (V ogel, T.,etal., 
1 CelLBiochem. 53:74-84 (1993); Tolsma, S.S.,e*a/., 7. CellBiol 122A97-SU 
(1993)). To test whether the same was true for the METH1 and METH2 TSP 
domains, peptides derived from the same region were synthesized and their anti- 
angiogenic activity was evaluated in the CAM assay. The results are shown in 
Figure 6B. Peptides derived from both the TSP domain of METH1 and METH2 
blocked VEGF-induced angiogenesis similarly to that of TSP1. In contrast, 
scramble peptides had no significant effects. 

Example 5: Proliferation assays 

Human dermal endothelial cells (HDEC) were isolated and grown on 
Vitrogen™ coated petri-dishes in EBM (Clonetics, San Diego, C A) supplemented 
with 15% fetal calf serum, 25ng/ml cAMP, and lug/ml of hydrocortisone-21- 
acetate and were used from passages 3 to 6. Cells were made quiescent by 
incubation of confluent monolayers with phenol red-free EBM containing 0.2% 
BS A for 48h. Human dermal fibroblasts were isolated from neonatal foreskin and 
by enzymatic dissociation. Both fibroblasts and smooth muscle cells were 
maintained in DMEM supplemented with 1 0% fetal calf serum. Human mammary 
epithelial cells (HMEC) were purchased from Clonetics and maintained in the 
recommended media (mammary epithelial growth media, MEGM). 

Quiescent human dermal endothelial cells, between passage 3 and 6, were 
plated on Vhrogen™ coated 24-well plates in EBM supplemented with 0.2% 
BSA, 0. 1% fetal calf serum and 1 ng/ml of bFGF in the presence or absence of the 
recombinant protein and incubated at 5% C0 2 at 37°C for 48h. For vascular 
smooth muscle (VSM) and fibroblast proliferation assays, cells were incubated 
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under the same conditions but using DMEM instead of EBM. Human mammary 
epithelial cells were incubated on their growth media. A pulse of [ 3 H]-Thymidine 
(1 \iC\I\d) was added during the last 4h prior harvesting. Cells were washed and 
fixed in 10% TCA. Incorporation of [ 3 H] -thymidine was determined by 
scintillation counting, as previously described (Iruela-Arispe, M L & Sage, E.H., 
1 Cell Biochem. 52:414 (1993)). 

Statistical analysis were done using In-Stat software (Graph Pad Software) 
for Macintosh. Assuming normal distributions, data were analyzed by one-way 
ANOVA, followed by either T-test Dunnett test for comparisons between groups, 
or student-Newman-Kleus test for multiple comparisons between groups. 

To gain insight into the mechanism by which METH1 and METH2 inhibit 
neovascularization, the direct effect of the purified recombinant fusion proteins on 
endothelial cell proliferation was tested. Serum-starved endothelial cells were 
plated into growth medium containing bFGF and FCS. Recombinant proteins 
(3ng/ml) were added at the same time of plating. 40%(GST-METH1), 45% (6H- 
GST) or 36% (GST-METH2) inhibition was observed, in contrast to a non- 
significant effect when GST alone was added. The recombinant protein from the 
type I repeats of TSP1 had similar inhibitory effects. (Figure 7A). Furthermore, 
suppression of proliferation mediated by METH1 or METH2 were dose- 
dependent, as shown in Figure 7E. The inhibition was observed as early as one 
day after treatment and the inhibitory effect was not toxic and reversible since the 
removal of the recombinant protein and subsequent addition of growth factor 
alone led to the resumption of endothelial cell proliferation. 

The cell specificity of the anti-proliferative effects for METH1 and 
METH2 on the endothelium was evaluated by additional proliferation assays on 
a variety of non-endothelial cells. No significant inhibition of proliferation was 
seen on fibroblasts or smooth muscle cell cultures. In contrast, a non significant, 
but reproducible stimulation of proliferation for these two cell types could be 
observed. This result rules out the presence of any potential nonspecific inhibitor 
of cell growth in the recombinant protein preparations. On mammary epithelial 
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cell, however, METH1 and METH2 inhibited cell proliferation to the same degree 
as to endothelial cells. Interestingly, TSP1 also suppresses mammary epithelial 
cell proliferation both in vitro and in a transgenic model. 

The possibility that METH1 and METH2 might act as disintegrins is 
consistent with their anti-angiogenic properties. Clearly blockade of av03 and p 1 
integrins with antibodies has been shown to inhibit neovascularization both during 
development andin tumors (Brooks, PC, etal, G?//55:683-693 (1996); Brooks, 
P.C., et al % Cell 92:391-400 (1998); Senger, D R., et al, Proc. Nail. Acad Set 
USA 94:13612-13617 (1997)). Integrins are essential for the mediation of both 
proliferative and migratory signals (Schwartz, MA. & Ingber, D.E., Mol Biol 
Cell 5:389-393 (1994)), therefore interference with those signals can be highly 
deleterious to the angiogenic process. The angiogenic functional assays were 
performed with recombinant protein containing only the type I repeats in METH1 
and METH2. 

The mechanism of action of METH1 and METH2 with regards to their 
angio-inhibitoiy activity is not known. To date we have evidence that these 
proteins are secreted and bind to endothelial cells. Further investigations are 
guided towards the identification of receptors and signal transduction mechanisms. 
A likely hypothesis resulting from the lessons learned from TSP1 is that both 
METH1 and METH2 bind to CD36. Recently, this scavenger receptor has been 
implicated in the mediation of signals by which TSP-1 exert its anti-angiogenic 
effects (Dawson, D.W., et al, J. Cell Biol 735:707-717 (1997)). Both the 
CSVTCG (SEQ ID NO:83) (Asch, A.S., et al, Nature 262:1436-1439 (1993); 
Catimel, B., et aL. Biochem. J. 28423 1-236 (1992)) and the GCQXR (SEQ ID 
NO:84) sequences have been proposed as primary binding motifs to CD36 
(Dawson, D.W., et al., J. CellBiol 735:707-717(1997)). METHl andMETH2 
have almost entire conservation in both these regions. A complementary and also 
likely occurrence is binding ofMETHl and METH2 to bFGF. Binding to heparin 
and bFGF has been proposed as part of the anti-angiogenic activity of TSP 1 (Guo, 
N., et al t J. Peptide Res. 49 (1997)). This property appears to be mediated 
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through the WSXWS (SEQ ID NO:82) motif, also conserved in METH1 and 
METH2. Future efforts will focus on the signals implicated in the anti-angiogenic 
properties mediated by these novel proteins and on their potential as proteases of 
the extracellular milieu. 

Example 6: Isolation of the METH1 or METH2 cDNA Clone From the 
Deposited Sample 

Two approaches can be used to isolate METH1 or METH2 from the 
deposited sample. First, the deposited clone is transformed into a suitable host 
(such as XL-1 Blue (Stratagene)) using techniques known to those of skill in the 
art, such as those provided by the vector supplier or in related publications or 
patents. The transformants are plated on 1.5% agar plates (containing the 
appropriate selection agent, e.g., ampicillin) to a density of about 150 
transformants (colonies) per plate. A single colony is then used to generate DNA 
using nucleic acid isolation techniques well known to those skilled in the art. (e.g., 
Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Edit., (1989), 
Cold Spring Harbor Laboratory Press.) 

Alternatively, two primers of 1 7-20 nucleotides derived from both ends of 
the SEQ ID NO:l or SEQ ID NO:3 (i.e., within the region of SEQ ID NO:l or 
SEQ ID NO:3 bounded by the 5" NT and the 3' NT of the clone) are synthesized 
and used to amplify the METH1 or METH2 cDNA using the deposited cDNA 
plasmidsas templates. The polymerase chain reaction is carried out under routine 
conditions, for instance, in 25 pi of reaction mixture with 0.5 fig of the above 
cDNA template. A convenient reaction mixture is 1.5-5 mMMgCl^ 0.01% (w/v) 
gelatin, 20 uM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 
0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94 
degree C for 1 min; annealing at 55 degree C for 1 min; elongation at 72 degree 
C for 1 min) are performed with a Perkin-Elmer Cetus automated thermal cycler. 
The amplified product is analyzed by agarose gel electrophoresis and the DNA 
band with expected molecular weight is excised and purified. The PCR product 
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is verified to be the selected sequence by subcloning and sequencing the DNA 
product. 

Several methods are available for the identification of the 5* or 3 1 non- 
coding portions of the METH1 or METH2 gene which may not be present in the 
deposited clones. These methods include but are not limited to, filter probing, 
clone enrichment using specific probes, and protocols similar or identical to 5' and 
3' "RACE" protocols which are well known in the art. For instance, a method 
similar to 5' RACE is available for generating the missing 5' end of a desired full- 
length transcript. (Fromont-Racine e( al. y Nucleic Acids Res. 27(7): 1683-1684 
(1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a 
population of RNA presumably containing full-length gene RNA transcripts. A 
primer set containing a primer specific to the ligated RNA oligonucleotide and a 
primer specific to a known sequence of the METH1 or METH2 gene of interest 
is used to PCR amplify the 5* portion of the METH1 or METH2 full-length gene. 
This amplified product may then be sequenced and used to generate the full length 
gene. 

This above method starts with total RNA isolated from the desired source, 
although poly-A+ RNA can be used. The RNA preparation can then be treated 
with phosphatase if necessary to eliminate 5 1 phosphate groups on degraded or 
damaged RNA which may interfere with the later RNA ligase step. The 
phosphatase should then be inactivated and the RNA treated with tobacco acid 
pyrophosphatase in order to remove the cap structure present at the 5' ends of 
messenger RNAs. This reaction leaves a 5* phosphate group at the 5' end of the 
cap cleaved RNA which can then be ligated to an RNA oligonucleotide using T4 
RNA ligase. 

This modified RNA preparation is used as a template for first strand cDN A 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction 
is used as a template for PCR amplification of the desired 5' end using a primer 
specific to the ligated RNA oligonucleotide and a primer specific to the known 
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sequence of the gene of interest. The resultant product is then sequenced and 
analyzed to confirm that the 5' end sequence belongs to the METH1 or METH2 
gene. 

Example 7; Bacterial Expression ofMETHl or METH2 

A METH1 or METH2 polynucleotide encoding a METH1 or METH2 
polypeptide invention is amplified using PCR oligonucleotide primers 
corresponding to the 5' and 3* ends of the DNA sequence, as outlined in Example 
5, to synthesize insertion fragments. The primers used to amplify the cDNA insert 
should preferably contain restriction sites, such as BamHI and Xbal, at the 5* end 
of the primers in order to clone the amplified product into the expression vector. 
For example, BamHI and Xbal correspond to the restriction enzyme sites on the 
bacterial expression vector pQE-9. (Qiagen, Inc., Chatsworth, CA). This plasmid 
vector encodes antibiotic resistance (Amp 1 ), a bacterial origin of replication (ori), 
an IPTG-regulatable promoter/operator (P/O), a ribosome binding site (RBS), a 
6-histidine tag (6-His), and restriction enzyme cloning sites. The pQE-9 

vector is digested with BamHI and Xbal and the amplified fragment is ligated into 
the pQE-9 vector maintaining the reading frame initiated at the bacterial RBS. 
The ligation mixture is then used to transform the E. coli strain M 1 5/rep4 (Qiagen, 
Inc.) which contains multiple copies of the plasmid pREP4, which expresses the 
lacl repressor and also confers kanamycin resistance (Kan 1 ) Transformants are 
identified by their ability to grow on LB plates and ampicillin/kanamycin resistant 
colonies are selected. Plasmid DNA is isolated and confirmed by restriction 
analysis. 

Clones containing the desired constructs are grown overnight (O/N) in 
liquid culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 
ug/ml). The O/N culture is used to inoculate a large culture at a ratio of 1 : 1 00 to 
1 :250. The cells are grown to an optical density 600 (O.D , 600 ) of between 0.4 and 
0.6. IPTG (Isopropyl-B-D-thiogalacto pyranoside) is then added to a final 
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concentration of 1 mM. EPTG induces by inactivating the lad repressor, clearing 
the P/O leading to increased gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic 
5 agent 6 Molar Guanidine HC1 by stirring for 3-4 hours at 4 degree C. The cell 

debris is removed by centrifugation, and the supernatant containing the 
polypeptide is loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin 
column (available from QIAGEN, Inc., supra). Proteins with a 6 x His tag bind 
to the Ni-NTA resin with high affinity and can be purified in a simple one-step 

10 procedure (for details see: The QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, 
pH 8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, 
then washed with 10 volumes of 6 M guanidine-HCl pH 6, and finally the 
polypeptide is eluted with 6 M guanidine-HCl, pH 5. 

15 The purified METH1 or METH2 protein is then renatured by dialyzing it 

against phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 
200 mM NaCl. Alternatively, the METH1 or METH2 protein can be successfully 
refolded while immobilized on the Ni-NTA column. The recommended conditions 
are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCl, 

20 20% glycerol, 20 mM Tris/HCl pH 7.4, containing protease inhibitors. The 

renaturation should be performed over a period of 1.5 hours or more. After 
renaturation the proteins are eluted by the addition of 250 mM immidazole. 
Immidazole is removed by a final dialyzing step against PBS or 50 mM sodium 
acetate pH 6 buffer plus 200 mM NaCl. The purified METH1 or METH2 protein 

25 is stored at 4°C or frozen at -80°C. 

In addition to the above expression vector, the present invention further 
includes an expression vector comprising phage operator and promoter elements 
operatively linked to aMETHl or METH2 polynucleotide, called pHE4a. (ATCC 
Accession Number 209645, deposited February 25, 1998.) This vector contains: 

30 1) a neomycinphosphotransferase gene as a selection marker, 2) an£. coli origin 
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of replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 
5) a Shine-Delgarao sequence, and 6) the lactose operon repressor gene (laclq). 
The origin of replication (oriC) is derived from pUC19 (LTI, Gaithersburg, MD). 
The promoter sequence and operator sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 
Xbal, BamHI, Xhol, or Asp718, running the restricted product on a gel, and 
isolating the larger fragment (the stuffer fragment should be about 3 1 0 base pairs) . 
The DNA insert is generated according to the PCR protocol described in Example 
5, using PCR primers having restriction sites for Ndel (5' primer) and Xbal, 
BamHI, Xhol, or Asp718 (3'primer). The PCR insert is gel purified and restricted 
with compatible enzymes. The insert and vector are ligated according to standard 
protocols. 

The engineered vector could easily be substituted in the above protocol to 
express protein in a bacterial system. 

Example 8: Purification ofMETHl orMETH2 Polypeptide from an Inclusion 
Body 

The following alternative method can be used to purify METH1 or 
METH2 polypeptide expressed in E coli when it is present in the form of inclusion 
bodies. Unless otherwise specified, all of the following steps are conducted at 
4-10°C. 

Upon completion of the production phase of the K coli fermentation, the 
cell culture is cooled to 4-10 degree C and the cells harvested by continuous 
centrifugation at 15,000 rpm (Heraeus Sepatech). On the basis of the expected 
yield of protein per unit weight of cell paste and the amount of purified protein 
required, an appropriate amount of cell paste, by weight, is suspended in a buffer 
solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed 
to a homogeneous suspension using a high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfiiidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The 
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homogenate is then mixed with NaCl solution to a final concentration of 0.5 M 
NaCl, followed by centrifugation at 7000 xg for 15 min. The resultant pellet is 
washed again using 0.5M NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1 . 5 M guanidine 
hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., 
the pellet is discarded and the polypeptide containing supernatant is incubated at 
4 degree C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble 
particles, the GuHCl solubilized protein is refolded by quickly mixing the GuHCl 
extract with 20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM 
NaCl, 2 mM EDTA by vigorous stirring. The refolded diluted protein solution is 
kept at 4 degree C without mixing for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared 
tangential filtration unit equipped with 0. 1 6 urn membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros 
HS-50, Perseptive Biosystems). The column is washed with 40 mM sodium 
acetate, pH 6.0 and ehited with 250 mM, 500 mM, 1 000 mM, and 1 500 mM NaCl 
in the same buffer, in a stepwise manner. The absorbance at 280 nm of the 
effluent is continuously monitored. Fractions are collected and further analyzed 
by SDS-PAGE. 

Fractions containing the METH1 or METH2 polypeptide are then pooled 
and mixed with 4 volumes of water. The diluted sample is then loaded onto a 
previously prepared set of tandem columns of strong anion (Poros HQ-50, 
Perseptive Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) 
exchange resins. The columns are equilibrated with 40 mM sodium acetate, pH 
6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM 
NaCl. The CM-20 column is then eluted using a 1 0 column volume linear gradient 
ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1 .0 M NaCl, 50 mM 
sodium acetate, pH 6.5. Fractions are collected under constant A^ monitoring 
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of the effluent. Fractions containing the polypeptide (determined, for instance, by 
16% SDS-PAGE) are then pooled. 

The resultant METH1 or METH2 polypeptide should exhibit greater than 
95% purity after the above refolding and purification steps. No major contaminant 
bands should be observed from Coomassie blue stained 1 6% SDS-PAGE gel when 
5 ug of purified protein is loaded. The purified METH1 or METH2 protein can 
also be tested for endotoxin/LPS contamination, and typically the LPS content is 
less than 0. 1 ng/ml according to LAL assays. 

Example 9: Cloning and Expression of METH1 orMETH2 in a Baculovirus 
Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert METH1 
or METH2 polynucleotide into a baculovirus to express METH1 or METH2. 
This expression vector contains the strong polyhedrin promoter of the A utographa 
californica nuclear polyhedrosis virus (AcMNPV) followed by convenient 
restriction sites such as BamHI, Xba I and Asp71 8. The polyadenylation site of 
the simian virus 40 ("SV40") is used for efficient polyadenylation. For easy 
selection of recombinant virus, the plasmid contains the beta-galactosidase gene 
from E. coli under control of a weak Drosophila promoter in the same orientation, 
followed by the polyadenylation signal of the polyhedrin gene. The inserted genes 
are flanked on both sides by viral sequences for cell-mediated homologous 
recombination with wild-type viral DN A to generate a viable virus that express the 
cloned METH1 or METH2 polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, 
such as pAc373, pVL941, and pAcIMl, as one skilled in the art would readily 
appreciate, as long as the construct provides appropriately located signals for 
transcription, translation, secretion and the like, including a signal peptide and an 
in-frame AUG as required. Such vectors are described, for instance, in Luckow 
et al., Virology 170:31-39 (1989). 
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Specifically, the METH1 or METH2 cDNA sequence contained in the 
deposited clone, including the AUG initiation codon and any naturally associated 
leader sequence, is amplified using the PCR protocol described in Example 5. If 
the naturally occurring signal sequence is used to produce the secreted protein, the 
pA2 vector does not need a second signal peptide. Alternatively, the vector can 
be modified (pA2 GP) to include a baculovirus leader sequence, using the 
standard methods described in Summers et al., "A Manual of Methods for 
Baculovirus Vectors and Insect Cell Culture Procedures," Texas Agricultural 
Experimental Station Bulletin No. 1555 (1987). 

The amplified fragment is isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The 
fragment then is digested with appropriate restriction enzymes and again purified 
on a 1% agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using 
routine procedures known in the art. The DNA is then isolated from a 1% 
agarose gel using a commercially available kit ("Geneclean" BIO 101 Inc., La 
Jolla, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with 
T4 DNA ligase. R coli HB 1 0 1 or other suitable E. coli hosts such as XL- 1 Blue 
(Stratagene Cloning Systems, La Jolla, C A) cells are transformed with the ligation 
mixture and spread on culture plates. Bacteria containing the plasmid are 
identified by digesting DNA from individual colonies and analyzing the digestion 
product by gel electrophoresis. The sequence of the cloned fragment is confirmed 
by DNA sequencing. 

Five ug of a plasmid containing the polynucleotide is co-transfected with 
1.0 ug of a commercially available linearized baculovirus DNA ("BaculoGold* 
baculovirus DNA W , Pharmingen, San Diego, CA), using the lipofection method 
described by Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). 
One ug of BaculoGold" virus DNA and 5 ug of the plasmid are mixed in a sterile 
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well of a microtiter plate containing 50 ul of serum-free Grace's medium (Life 
Technologies Inc., Gaithersburg, MD). Afterwards, 10 ul Lipofectin plus 90 ul 
Grace's medium are added, mixed and incubated for 15 minutes at room 
temperature. Then the transfection mixture is added drop-wise to Sf9 insect cells 
5 (ATCC CRL 171 1) seeded in a 35 mm tissue culture plate with 1 ml Grace's 

medium without serum. The plate is then incubated for 5 hours at 27 degrees C. 
The transfection solution is then removed from the plate and 1 ml of Grace's 
insect medium supplemented with 10% fetal calf serum is added. Cultivation is 
then continued at 27 degrees C for four days. 

10 After four days the supernatant is collected and a plaque assay is 

performed, as described by Summers and Smith, supra. An agarose gel with 
"Blue Gal" (Life Technologies Inc., Gaithersburg) is used to allow easy 
identification and isolation of gal-expressing clones, which produce blue-stained 
plaques. (A detailed description of a "plaque assay" of this type can also be found 

15 in the user's guide for insect cell culture and baculovirology distributed by Life 

Technologies Inc., Gaithersburg, page 9-10.) After appropriate incubation, blue 
stained plaques are picked with the tip of a micropipettor (e.g., Eppendorf). The 
agar containing the recombinant viruses is then resuspended in a microcentrifuge 
tube containing 200 ul of Grace's medium and the suspension containing the 

20 recombinant baculovirus is used to infect Sf9 cells seeded in 3 5 mm dishes. Four 

days later the superaatants of these culture dishes are harvested and then they are 
stored at 4 degree C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's 
medium supplemented with 1 0% heat-inactivated FB S . The cells are infected with 

25 the recombinant baculovirus containing the polynucleotide at a multiplicity of 

infection ("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later 
the medium is removed and is replaced with SF900 II medium minus methionine 
and cysteine (available from Life Technologies Inc., Rockville, MD). After 42 
hours, 5 uCi of 35 S-methionine and 5 uCi 35 S-cysteine (available from Amersham) 

30 are added. The cells are further incubated for 16 hours and then are harvested by 
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centrifugation. The proteins in the supernatant as well as the intracellular proteins 
are analyzed by SDS-PAGE followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of 
purified protein may be used to determine the amino terminal sequence of the 
produced METH1 or METH2 protein. 

Example 10: Expression of METH1 orMETH2 in Mammalian Cells 

METH1 or METH2 polypeptide can be expressed in a mammalian cell. A 
typical mammalian expression vector contains a promoter element, which mediates 
the initiation of transcription of mRNA, a protein coding sequence, and signals 
required for the termination of transcription and polyadenylation of the transcript. 
Additional elements include enhancers, Kozak sequences and intervening 
sequences flanked by donor and acceptor sites for RNA splicing. Highly efficient 
transcription is achieved with the early and late promoters from SV40, the long 
terminal repeats (LTRs) from Retroviruses, e.g., RSV, HTLVI, HTVI and the 
early promoter of the cytomegalovirus (CMV). However, cellular elements can 
also be used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention 
include, for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, 
Sweden), pRSVcat (ATCC 37152), pSV2DHFR (ATCC 37146), pBC12MI 
(ATCC 67109), pCMVSport 2.0, and pCMVSport 3.0. Mammalian host cells 
that could be used include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 
and CI 27 cells, Cos 1, Cos 7 and CV1, quail QC1-3 cells, mouse L cells and 
Chinese hamster ovary (CHO) cells. 

Alternatively, METH1 or METH2 polypeptide can be expressed in stable 
cell lines containing the METH1 or METH2 polynucleotide integrated into a 
chromosome. The co-transfection with a selectable marker such as DHFR, gpt, 
neomycin, hygromycin allows the identification and isolation of the transfected 
cells. 
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The transfected METH 1 or METH2 gene can also be amplified to express 
large amounts of the encoded protein. The DHFR (dihydrofolate reductase) 
marker is useful in developing cell lines that carry several hundred or even several 
thousand copies of the gene of interest. (See, e.g., Alt, F. W., et al y J. Biol. 
5 Chem. 253:1357-1370 (1978); Hamlin, J. L. and Ma, C, Biochem. et Biophys. 

Acta 7097:107-143 (1990); Page, M. J. and Sydenham, M. A., Biotechnology 
9:64-68 (1991).) Another useful selection marker is the enzyme glutamine 
synthase(GS) (MwrphyetaL y BiochemJ. 227:277-279 (1991);Bebbingtone/a/., 
Bio/Technology 70:169-175 (1992). Using these markers, the mammalian cells 

10 are grown in selective medium and the cells with the highest resistance are 

selected. These cell lines contain the amplified gene(s) integrated into a 
chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-DHFR (ATCC Accession No. 37146), 

15 the expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC 

Accession No.209647) contain the strong promoter (LTR) of the Rous Sarcoma 
Virus (Cullen et al y Molecular and Cellular Biology, 438-447 (March, 1985)) 
phis a fragment of the CMV-enhancer (Boshart et al y Cell 47:521-530 (1985).) 
Multiple cloning sites, e.g., with the restriction enzyme cleavage sites BamHI, 

20 Xbal and Asp71 8, facilitate the cloning of METH 1 or METH2. The vectors also 

contain the 3' intron, the polyadenylation and termination signal of the rat 
preproinsulin gene, and the mouse DHFR gene under control of the S V40 early 
promoter. 

If a naturally occurring signal sequence is used to produce a secreted 
25 protein, the vector does not need a second signal peptide. Alternatively, if a 

naturally occurring signal sequence is not used, the vector can be modified to 
include a heterologous signal sequence in an effort to secrete the protein from the 
cell. (See, e.g., WO 96/34891.) 

The amplified fragment is then digested with the appropriate restriction 
30 enzyme and purified on a 1% agarose gel using a commercially available kit 
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("Geneclean," BIO 101 Inc., La Jolla, Ca.). The isolated fragment and the 
dephosphorylated vector are then ligated with T4 DN A ligase. E. coli HB 1 0 1 or 
XL-1 Blue cells are then transformed and bacteria are identified that contain the 
fragment inserted into plasmid pC6 or pC4 using, for instance, restriction enzyme 
analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for 
transfection. Five ng of the expression plasmid pC6 or pC4 is cotransfected with 
0.5 ug of the plasmid pSVneo using lipofectin (Feigner et al., supra). The plasmid 
pS V2-neo contains a dominant selectable marker, the neo gene from Tn5 encoding 
an enzyme that confers resistance to a group of antibiotics including G41 8. The 
cells are seeded in alpha minus MEM supplemented with 1 mg/ml G41 8. After 2 
days, the cells are trypsinized and seeded in hybridoma cloning plates (Greiner, 
Germany) in alpha minus MEM supplemented with 10, 25, or 50 ng/ml of 
metothrexate plus 1 mg/ml G418. After about 10-14 days single clones are 
trypsinized and then seeded in 6-well petri dishes or 10 ml flasks using different 
concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). 
Clones growing at the highest concentrations of methotrexate are then transferred 
to new 6-well plates containing even higher concentrations of methotrexate ( 1 uM, 
2 uM, 5 uM, 10 mM, 20 mM). The same procedure is repeated until clones are 
obtained which grow at a concentration of 1 00 - 200 uM. Expression of METH1 
or METH2 is analyzed, for instance, by SDS-PAGE and Western blot or by 
reversed phase HPLC analysis. 

Example 11: Construction of N-Terminal and/or C-Terminal Deletion 
Mutants 

The following general approach may be used to clone a N-terminal or C- 
terminal deletion METH1 or METH2 deletion mutant. Generally, two 
oligonucleotide primers of about 15-25 nucleotides are derived from the desired 
5' and 3' positions of a polynucleotide of SEQ ID NO:l or SEQ ID NO:3. The 
5' and 3' positions of the primers are determined based on the desired METH1 or 
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METH2 polynucleotide fragment. An initiation and stop codon are added to the 
5' and 3* primers respectively, if necessary, to express the METH1 or METH2 
polypeptide fragment encoded by the polynucleotide fragment. Preferred METH 1 
or METH2 polynucleotide fragments are those encoding the N-terminal and C- 
terminal deletion mutants disclosed above in the "Polynucleotide and Polypeptide 
Fragments" section of the Specification. 

Additional nucleotides containing restriction sites to facilitate cloning of 
the METH1 or METH2 polynucleotide fragment in a desired vector may also be 
added to the 5* and 3' primer sequences. The METH1 or METH2 polynucleotide 
fragment is amplified from genomic DNA or from the deposited cDNA clone 
using the appropriate PCR oligonucleotide primers and conditions discussed 
herein or known in the art. The METH1 or METH2 polypeptide fragments 
encoded by the METH1 or METH2 polynucleotide fragments of the present 
invention may be expressed and purified in the same general manner as the full 
length polypeptides, although routine modifications may be necessary due to the 
differences in chemical and physical properties between a particular fragment and 
full length polypeptide. 

As a means of exemplifying but not limiting the present invention, the 
polynucleotide encoding the METH1 polypeptide fragment D-40 to S-950 or the 
METH2 polypeptide fragment L-20 to L-890 is amplified and cloned as follows: 
A 5' primer is generated comprising a restriction enzyme site followed by an 
initiation codon in frame with the polynucleotide sequence encoding the N- 
terminal portion of the polypeptide fragment beginning with D-40 or L-20, 
respectively. A complementary 3* primer is generated comprising a restriction 
enzyme site followed by a stop codon in frame with the polynucleotide sequence 
encoding C-terminal portion of the METH1 or METH2 polypeptide fragment 
ending with S-950 or L-890, respectively. 

The amplified polynucleotide fragment and the expression vector are 
digested with restriction enzymes which recognize the sites in the primers. The 
digested polynucleotides are then ligated together. The METH1 or METH2 
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polynucleotide fragment is inserted into the restricted expression vector, 
preferably in a manner which places the METH 1 or METH2 polypeptide fragment 
coding region downstream from the promoter. The ligation mixture is 
transformed into competent E. coli cells using standard procedures and as 
described in the Examples herein. Plasmid DN A is isolated from resistant colonies 
and the identity of the cloned DNA confirmed by restriction analysis, PCR and 
DNA sequencing. 

Example 12: Protein Fusions ofMETHl or METH2 

METH1 or METH2 polypeptides are preferably fused to other proteins. 
These fusion proteins can be used for a variety of applications. For example, 
fusion ofMETHl or METH2 polypeptides to His-tag, HA-tag, protein A, IgG 
domains, and maltose binding protein facilitates purification. (See Example 7; see 
also EP A 394,827; Traunecker, et al., Nature 337:84-86 (1988).) Similarly, 
fusion to IgG-1, IgG-3, and albumin increases the halflife time in vivo. Nuclear 
localization signals fused to METH1 or METH2 polypeptides can target the 
protein to a specific subcellular localization, while covalent heterodimer or 
homodimers can increase or decrease the activity of a fusion protein. Fusion 
proteins can also create chimeric molecules having more than one function. 
Finally, fusion proteins can increase solubility and/or stability of the fused protein 
compared to the non-fused protein. All of the types of fusion proteins described 
above can be made by modifying the following protocol, which outlines the fusion 
of a polypeptide to an IgG molecule, or the protocol described in Example 7. 

Briefly, the human Fc portion of the IgG molecule can be PCR amplified, 
using primers that span the 5 1 and 3' ends of the sequence described below. These 
primers also should have convenient restriction enzyme sites that will facilitate 
cloning into an expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc 
portion can be ligated into the BamHI cloning site. Note that the 3* BamHI site 
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should be destroyed. Next, the vector containing the human Fc portion is re- 
restricted with BamHI, linearizing the vector, and METH1 or METH2 
polynucleotide, isolated by the PCR protocol described in Example 5, is ligated 
into this BamHI site. Note that the polynucleotide is cloned without a stop cod on, 
otherwise a fusion protein will not be produced. 

If the naturally occurring signal sequence is used to produce the secreted 
protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

Human IgG Fc region: 

GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACC 
GTGCCCAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCC 
CCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACTCCTGAGGT 
CACATGCGTGGTGGTGGACGTAAGCCACGAAGACCCTGAGGTCAAG 
TTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAA 
AGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGT 
CCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAG 
TGCAAGGTCTCCAACAAAGCCCTCCCAACCCCCATCGAGAAAACCA 
TCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCT 
GCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACC 
TGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGTGGAGTGGG 
AGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGT 
GCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGG 
ACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGAT 
GCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTG 
TCTCCGGGTAAATGAGTGCGACGGCCGCGACTCTAGAGGAT (SEQ ID 
NO:85) 
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Example 13: Production of an Antibody 

The antibodies of the present invention can be prepared by a variety of 
methods. (See, Current Protocols, Chapter 2.) For example, cells expressing 
METH1 or METH2 is administered to an animal to induce the production of sera 
containing polyclonal antibodies. In a preferred method, a preparation of METH 1 
or METH2 protein is prepared and purified to render it substantially free of natural 
contaminants. Such a preparation is then introduced into an animal in order to 
produce polyclonal antisera of greater specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al y Nature 
255:495(1975); Kohler et al y Eur. J. Immunol 6:511 (1976); Kohler et al, Eur. 
J. Immunol 6:292 (1976); Hammerling et al, in: Monoclonal Antibodies and T- 
Cell Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such 
procedures involve immunizing an animal (preferably a mouse) with METH1 or 
METH2 polypeptide or, more preferably, with a secreted METH1 or METH2 
polypeptide-expressing cell. Such cells may be cultured in any suitable tissue 
culture medium; however, it is preferable to culture cells in Earle's modified 
Eaglet medium supplemented with 10% fetal bovine serum (inactivated at about 
56 degree C), and supplemented with about 10 g/1 of nonessential amino acids, 
about 1,000 U/ml of penicillin, and about 100 ug/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable 
myeloma cell line. Any suitable myeloma cell line may be employed in accordance 
with the present invention; however, it is preferable to employ the parent myeloma 
cell line (SP20), available from the ATCC. After fusion, the resulting hybridoma 
cells are selectively maintained in HAT medium, and then cloned by limiting 
dilution as described by Wands et al. (Gastroenterology 50:225-232 (1981).) The 
hybridoma cells obtained through such a selection are then assayed to identify 
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clones which secrete antibodies capable of binding the METH1 or METH2 
polypeptide. 

Alternatively, additional antibodies capable of binding to METH1 or 
METH2 polypeptide can be produced in a two-step procedure using anti-idiotypic 
antibodies. Such a method makes use of the fact that antibodies are themselves 
antigens, and therefore, it is possible to obtain an antibody which binds to a second 
antibody. In accordance with this method, protein specific antibodies are used to 
immunize an animal, preferably a mouse. The splenocytes of such an animal are 
then used to produce hybridoma cells, and the hybridoma cells are screened to 
identify clones which produce an antibody whose ability to bind to the METH1 or 
METH2 protein-specific antibody can be blocked by METH1 or METH2. Such 
antibodies comprise anti-idiotypic antibodies to the METH1 or METH2 protein- 
specific antibody and can be used to immunize an animal to induce formation of 
further METH1 or METH2 protein-specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the 
antibodies of the present invention may be used according to the methods 
disclosed herein. Such fragments are typically produced by proteolytic cleavage, 
using enzymes such as papain (to produce Fab fragments) or pepsin (to produce 
F(ab')2 fragments). Alternatively, secreted METH1 or METH2 protein-binding 
fragments can be produced through the application of recombinant DNA 
technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced 
using genetic constructs derived from hybridoma cells producing the monoclonal 
antibodies described above. Methods for producing chimeric antibodies are 
known in the art . (See, for review, Morrison, Science 229: 1 202 ( 1 985); Oi et al , 
BioTechniques42\4(\9%6)\ Cabilly etai y U.S. Patent No. 4,816,567; Taniguchi 
etal. y EP 171496; Morrison etaL y EP 173494; Neuberger et al y WO 8601533; 
Robinson et al. y WO 8702671; Boulianne et al 7 Nature 312:643 (1984); 
Neuberger et al y Nature 314:26% (1985).) 
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Example 14: Production Of METH1 or METH2 Protein For High- 
Throughput Screening Assays 

The following protocol produces a supernatant containing METH1 or 
METH2 polypeptide to be tested. This supernatant can then be used in the 
Screening Assays described in Examples 16-23. 

First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(Img/mlinPBS) l:20inPBS(w/o calcium or magnesium 17-516FBiowhittaker) 
for a working solution of 50ug/mL Add 200 ul of this solution to each well (24 
well plates) and incubate at RT for 20 minutes. Be sure to distribute the solution 
over each well (note: a 1 2-channel pipetter may be used with tips on every other 
channel). Aspirate off the Poly-D-Lysine solution and rinse with 1ml PBS 
(Phosphate Buffered Saline). The PBS should remain in the well until just prior 
to plating the cells and plates may be poly-lysine coated in advance for up to two 
weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 1 0 5 cells/well in 5ml 
DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L- 
glutamine (12-604F Biowhittaker))/10% heat inactivated FBS(14-503F 
Biowhittaker)/lx Penstrep(l 7-602E Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well 
plate. With a small volume multi-channel pipetter, aliquot approximately 2ug of 
an expression vector containing a polynucleotide insert, produced by the methods 
described in Examples 10-12, into an appropriately labeled 96-well round bottom 
plate. With a multi-channel pipetter, add 50ul of the Lipofectamine/Optimem I 
mixture to each well. Pipette up and down gently to mix. Incubate at RT 1 5-45 
minutes. After about 20 minutes, use a multi-channel pipetter to add 150ul 
Optimem I to each well. As a control, one plate of vector DNA lacking an insert 
should be transfected with each set of transfections. 

Preferably, the transfection should be performed by tag-teaming the 
following tasks. By tag-teaming, hands on time is cut in half, and the cells do not 
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spend too much time on PBS. First, person A aspirates off the media from four 
24-well plates of cells, and then person B rinses each well with .5-lml PBS. 
Person A then aspirates off PBS rinse, and person B, using a!2-channel pipetter 
with tips on every other channel, adds the 200ul of DNA/Lipofectamine/Optimem 
5 I complex to the odd wells first, then to the even wells, to each row on the 24-well 

plates. Incubate at 37 degree C for 6 hours. 

While cells are incubating, prepare appropriate media, either 1%BSA in 
DMEM with lx penstrep, or HGS CHO-5 media (1 16.6 mg/L of CaCl 2 (anhyd); 
0.00130 mg/L CuS0 4 -5H 2 0; 0.050 mg/L of Fe(N0 3 )3-9H 2 0; 0.417 mg/L of 

10 FeS0 4 -7H 2 0; 3 1 1 .80 mg/L of Kcl; 28.64 mg/L of MgCl 2 ; 48.84 mg/L of MgS0 4 ; 

6995.50 mg/L of NaCl; 2400.0 mg/L of NaHC0 3 ; 62.50 mg/L of NaH 2 PO 4 -H 2 0; 
71 .02 mg/L of NajHPC^; .4320 mg/L of ZnS0 4 -7H 2 0; .002 mg/L of Arachidonic 
Acid; 1.022 mg/L of Cholesterol; .070 mg/L of DL-alpha-Tocopherol- Acetate; 
0.0520 mg/L of Linoleic Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of 

15 Myristic Acid; 0.010 mg/L of Oleic Acid; 0.010 mg/L of Palmitric Acid; 0.010 

mg/L of Palmitic Acid; 100 mg/L of Pluronic F-68; 0.010 mg/L of Stearic Acid; 
2.20 mg/L of Tween 80; 4551 mg/L of D-Glucose; 130.85 mg/ml of L- Alanine; 
147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml of L-Asparagine-H 2 0; 6.65 mg/ml 
of L-Aspartic Acid; 29.56 mg/ml of L-Cystme-2HCL-H 2 0; 31.29 mg/ml of L- 

20 Cystine-2HCL; 7.35 mg/ml of L-Ghitamic Acid; 365.0 mg/ml of L-Glutamine; 

18.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine-HCL-H 2 0; 106.97 mg/ml of 
L-Isoleucine; 11 1 .45 mg/ml of L-Leucine; 163.75 mg/ml of L-LysineHCL; 32.34 
mg/ml of L-Methionine; 68.48 mg/ml of L-Phenylalainine; 40.0 mg/ml of L- 
Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml ofL-Threonine; 19.22 mg/ml of 

25 L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H 2 0; and 99.65 mg/ml of L- 

Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1 .78 mg/L of 
Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3 .02 mg/L 
of Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.031 mg/L of Pyridoxine HCL; 
0.319 mg/L of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of 

30 Thymidine; 0.680 mg/L of Vitamin B 12 ; 25 mM of HEPES Buffer; 2.39 mg/L of 
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Na Hypoxanthine; 0. 1 05 mg/L of Lipoic Acid; 0.081 mg/L of Sodium Putrescine- 
2HCL; 55.0 mg/L of Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM 
of Ethanolamine; 0.122 mg/L of Ferric Citrate; 41.70 mg/L of Methyl-B- 
Cyclodextrin complexed with Linoleic Acid; 33.33 mg/L of Methyl-B- 
Cyclodextrin complexed with Oleic Acid; 10 mg/L of Methyl-B-Cyclodextrin 
complexed with Retinal Acetate. Adjust osmolality to 327 mOsm) with 2mm 
gfutamine and lx penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in 1L 
DMEM for a 10% BSA stock solution). Filter the media and collect 50 ul for 
endotoxin assay in 15ml polystyrene conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the 
end of the incubation period. Person A aspirates off the transfection media, while 
person B adds 1.5ml appropriate media to each well. Incubate at 37 degree C for 
45 or 72 hours depending on the media used: 1%BSA for 45 hours or CHO-5 
for 72 hours. 

On day four, using a 3 OOul multichannel pipetter, aliquot 600ul in one 1 ml 
deep well plate and the remaining supernatant into a 2ml deep well. The 
supernatants from each well can then be used in the assays described in Examples 
16-23. 

It is specifically understood that when activity is obtained in any of the 
assays described below using a supernatant, the activity originates from either the 
METH1 or METH2 polypeptide directly (e.g., as a secreted protein) or by 
METH1 or METH2 inducing expression of other proteins, which are then 
secreted into the supernatant. Thus, the invention further provides a method of 
identifying the protein in the supernatant characterized by an activity in a particular 
assay. 

Example 15: Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and 
proliferation of cells is called the Jaks-STATs pathway. Activated proteins in the 
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Jaks-STATs pathway bind to gamma activation site "GAS" elements or interferon- 
sensitive responsive element ("ISRE"), located in the promoter of many genes. 
The binding of a protein to these elements alter the expression of the associated 
gene. 

GAS and ISRE elements are recognized by a class of transcription factors 
called Signal Transducers and Activators ofTranscription, or " STATs. " There are 
six members of the STATs family. Statl and Stat3 are present in many cell types, 
as is Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and 
is not in many cell types though it has been found in T helper class I, cells after 
treatment with IL- 1 2. StatS was originally called mammary growth factor, but has 
been found at higher concentrations in other cells including myeloid cells. It can 
be activated in tissue culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus 
upon tyrosine phosphorylation by a set of kinases known as the Janus Kinase 
("Jaks") family. Jaks represent a distinct family of soluble tyrosine kinases and 
include Tyk2, Jakl, Jak2, and Jak3. These kinases display significant sequence 
similarity and are generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the 
Table below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 
64:621-51 (1995).) A cytokine receptor family, capable of activating Jaks, is 
divided into two groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, IL-6, 
IL-7, DL-9, IL-1 1, IL-12, IL-15, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, 
and thrombopoietin; and (b) Class 2 includes IFN-a, IFN-g, and IL- 1 0. The Class 
1 receptors share a conserved cysteine motif (a set of four conserved cysteines and 
one tryptophan) and a WSXWS motif (a membrane proxial region encoding Trp- 
Ser-Xxx-Trp-Ser (SEQ ID NO:82)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
activate STATs, which then translocate and bind to GAS elements. This entire 
process is encompassed in the Jaks-STATs signal transduction pathway. 
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Therefore, activation of the Jaks-STATs pathway, reflected by the binding 
of the GAS or the 1SRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and 
cytokines are known to activate the Jaks-STATs pathway. (See Table below.) 
5 Thus, by using GAS elements linked to reporter molecules, activators of the Jaks- 

STATs pathway can be identified. 
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To construct a synthetic GAS containing promoter element, which is used 
in the Biological Assays described in Examples 16-17, a PCR based strategy is 
employed to generate a GAS-SV40 promoter sequence. The 5* primer contains 
four tandem copies of the GAS binding site found in the IRF1 promoter and 
previously demonstrated to bind STATs upon induction with a range of cytokines 
(Rothman et aL, Immunity 7:457-468 (1994).), although other GAS or ISRE 
elements can be used instead. The 5' primer also contains 18bp of sequence 
complementary to the S V40 early promoter sequence and is flanked with an Xhol 
site. The sequence of the 5* primer is: 

5^GCGCCTCGAGATTTCCCCGAAATCTAGATTTCCCCGAAATGATTT 
CCCCGAAATGATT^CCCCGAAATATCTGCCATCTCAATTAG:3 , (SEQID 
NO:86) 

The downstream primer is complementary to the SV40 promoter and is 
flanked with a Hind III site: 5 ' : GC GGC AAGCTTTTTGC AAAGCCT AGGC : 3 ' 
(SEQ ID NO:87) 

PCR amplification is performed using the S V40 promoter template present 
in the B-gal:promoter plasmid obtained from Clontech. The resulting PCR 
fragment is digested with Xhol/Hind III and subcloned into BLSK2- 
(Stratagene.) Sequencing with forward and reverse primers confirms that the 
insert contains the following sequence: 

5' : CTCGAG ATTTCCCCG AAATCT AGATTTCCCCGAAATGATTTCCCC 
GAAATGATTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCA 
TAGTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGT 
TCCGCCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTATTTATGCA 
GAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAG 
GAGGCTTTTTTGGAGGCCTAGGCTTTTGCAA AAAGCTT :3 ' (SEQ ID 
NO:88) 

With this GAS promoter element linked to the SV40 promoter, a 
GAS:SEAP2 reporter construct is next engineered. Here, the reporter molecule 
is a secreted alkaline phosphatase, or "SEAP." Clearly, however, any reporter 



WO 99/37660 



PCT/US99/01313 



-152- 

molecule can be instead of SEAP, in this or in any of the other Examples. Well 
known reporter molecules that can be used instead of SEAP include 
chloramphenicol acetyltransferase (CAT), luciferase, alkaline phosphatase, B- 
galactosidase, green fluorescent protein (GFP), or any protein detectable by an 
antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSE AP-Promoter vector obtained from Clontech using Hindffl 
and Xhol, effectively replacing the S V40 promoter with the amplified GAS : S V40 
promoter element, to create the GAS-SEAP vector. However, this vector does 
not contain a neomycin resistance gene, and therefore, is not preferred for 
mammalian expression systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector 
using Sail and Notl, and inserted into a backbone vector containing the neomycin 
resistance gene, such as pGFP-1 (Clontech), using these restriction sites in the 
multiple cloning site, to create the GAS-SEAP/Neo vector. Once this vector is 
transfected into mammalian cells, this vector can then be used as a reporter 
molecule for GAS binding as described in Examples 16-17. 

Other constructs can be made using the above description and replacing 
GAS with a different promoter sequence. For example, construction of reporter 
molecules containing NFK-B and EGR promoter sequences are described in 
Examples 1 8 and 19. However, many other promoters can be substituted using 
the protocols described in these Examples. For instance, SRE, EL-2, NFAT, or 
Osteocalcin promoters can be substituted, alone or in combination (e.g., 
GAS/NF-KB/EGR, GAS/NF-KB, I1-2/NFAT, or NF-KB/GAS). Similarly, other 
cell lines can be used to test reporter construct activity, such as HELA (epithelial), 
HUVEC (endothelial), Reh (B-cell), Saos-2 (osteoblast), HUVAC (aortic), or 
Cardiomyocyte. 
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Example 16: High-Throughput Screening Assay for T-cell Activity 

The following protocol is used to assess T-cell activity of METH1 or 
METH2 by determining whether METH1 or METH2 supernatant proliferates 
and/or differentiates T-cells. T-cell activity is assessed using the GAS/SEAP/Neo 
construct produced in Example 15. Thus, factors that increase SEAP activity 
indicate the ability to activate the Jaks-STATS signal transduction pathway. The 
T-cell used in this assay is Jurkat T-cells (ATCC Accession No. TIB- 152), 
although Molt-3 cells (ATCC Accession No. CRL-1 552) and Molt-4 cells (ATCC 
Accession No. CRL-1 582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. In order to 
generate stable cell lines, approximately 2 million Jurkat cells are transfected with 
the GAS-SEAP/neo vector using DMRIE-C (Life Technologies)(transfection 
procedure described below). The transfected cells are seeded to a density of 
approximately 20,000 cells per well and transfectants resistant to 1 mg/ml genticin 
selected. Resistant colonies are expanded and then tested for their response to 
increasing concentrations of interferon gamma. The dose response of a selected 
clone is demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 
generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in 
RPMI + 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life 
Technologies) with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI- 
MEM containing 50 ul of DMRIE-C and incubate at room temperature for 1 5-45 
mins. 

During the incubation period, count cell concentration, spin down the 
required number of cells (10 7 per transfection), and resuspend in OPTI-MEM to 
a final concentration of 1 0 7 cells/ml. Then add 1 ml of 1 x 1 0 7 cells in OPTI-MEM 
to T25 flask and incubate at 37 degree C for 6 hrs. After the incubation, add 10 
ml of RPMI + 15% serum. 
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The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 
10% serum, 1 mg/ml Genticin, and \% Pen-Strep. These cells are treated with 
. supernatants containing METH 1 or METH2 polypeptides or METH 1 or METH2 
induced polypeptides as produced by the protocol described in Example 14. 

On the day of treatment with the supernatant, the cells should be washed 
and resuspended in fresh RPMI + 1 0% serum to a density of 500,000 cells per ml. 
The exact number of cells required will depend on the number of supernatants 
being screened. For one 96 well plate, approximately 10 million cells (for 10 
plates, 100 million cells) are required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the 
cells into a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, 
transfer 200 ul of cells into each well (therefore adding 100,000 cells per well). 

After all the plates have been seeded, 50 ul of the supernatants are 
transferred directly from the 96 well plate containing the supernatants into each 
well using a 12 channel pipette. In addition, a dose of exogenous interferon 
gamma (0. 1 , 1 .0, 1 0 ng) is added to wells H9, H10, and HI 1 to serve as additional 
positive controls for the assay. 

The 96 well dishes containing Jurkat cells treated with supernatants are 
placed in an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 
35 ul samples from each well are then transferred to an opaque 96 well plate using 
a 12 channel pipette. The opaque plates should be covered (using sellophene 
covers) and stored at -20 degree C until SEAP assays are performed according to 
Example 20. The plates containing the remaining treated cells are placed at 4 
degree C and serve as a source of material for repeating the assay on a specific 
well if desired. 

As a positive control, 1 00 Unit/ml interferon gamma can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in 
the positive control wells. 
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Example 1 7: High-Throughput Screening Assay Identifying Myeloid Activity 

The following protocol is used to assess myeloid activity of METH1 or 
METH2 by determining whether METH1 or METH2 proliferates and/or 
differentiates myeloid cells. Myeloid cell activity is assessed using the 
GAS/SEAP/Neo construct produced in Example 15. Thus, factors that increase 
SEAP activity indicate the ability to activate the Jaks-ST ATS signal transduction 
pathway. The myeloid cell used in this assay is U937, a pre-monocyte cell line, 
although TF-1, HL60, or KG1 can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct 
produced in Example 15, aDEAE-Dextran method (Kharbandae/. a!., 1 994, Cell 
Growth & Differentiation 5:259-265) is used. First, harvest 2xl0e 7 U937 cells 
and wash with PBS. The U937 cells are usually grown in RPMI 1640 medium 
containing 1 0% heat-inactivated fetal bovine serum (FBS) supplemented with 1 00 
units/ml penicillin and 100 mg/ml streptomycin. 

Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer 
containing 0.5 mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, 140 mM 
NaCl, 5 mM KC1, 375 uM Na 2 HP0 4 *7H 2 0, 1 mM MgCl 2) and 675 uM CaCl 2 . 
Incubate at 37 degree C for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
resuspend in 10 ml complete medium and incubate at 37 degree C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/ml G41 8. The G41 8-free medium is used for routine growth but every one to 
two months, the cells should be re-grown in 400 ug/ml G418 for couple of 
passages. 

These cells are tested by harvesting 1x1 0 8 cells (this is enough for ten 96- 
well plates assay) and wash with PBS. Suspend the cells in 200 ml above 
described growth medium, with a final density of 5x1 0 s cells/ml. Plate 200 ul cells 
per well in the 96-well plate (or lxl 0 5 cells/well). 
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Add 50 ul of the supernatant prepared by the protocol described in 
Example 14. Incubate at 37 degee C for 48 to 72 hr. As a positive control, 100 
Unit/ml interferon gamma can be used which is known to activate U937 cells. 
Over 30 fold induction is typically observed in the positive control wells. SEAP 
5 assay the supernatant according to the protocol described in Example 20. 

Example 1 8: High- Throughput Screening Assay Identifying Neuronal Activity 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these 
genes, EGR1 (early growth response gene 1), is induced in various tissues and cell 

1 0 types upon activation. The promoter of EGR1 is responsible for such induction. 

Using the EGR1 promoter linked to reporter molecules, activation of cells can be 
assessed by METH1 or METH2. 

Particularly, the following protocol is used to assess neuronal activity in 
PC 12 cell lines. PC 12 cells (rat phenochromocytoma cells) are known to 

1 5 proliferate and/or differentiate by activation with a number of mitogens, such as 

TPA (tetradecanoyl phorbol acetate), NGF (nerve growth factor), and EGF 
(epidermal growth factor). The EGR1 gene expression is activated during this 
treatment. Thus, by stably transfecting PC 12 cells with a construct containing an 
EGR promoter linked to SEAP reporter, activation of PCI 2 cells by METH1 or 

20 METH2 can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following 
protocol. The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al. y 
Oncogene 6:867-871 (1991)) can be PCR amplified from human genomic DNA 
using the following primers: 

25 5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID 

NO:89) 

5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3 ' (SEQ ID 

NO:90) 
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Using the GAS:SEAP/Neo vector produced in Example 15, EGR1 
amplified product can then be inserted into this vector. Linearize the 
GASSEAP/Neo vector using restriction enzymes Xhol/Hindlll, removing the 
GAS/SV40 staffer. Restrict the EGR1 amplified product with these same 
enzymes. Ligate the vector and the EGR1 promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution 
(1:30 dilution of collagen type I (Upstate Biotech Inc. Cat#08-115) in 30% 
ethanol (filter sterilized)) is added per one 1 0 cm plate or 50 ml per well of the 96- 
well plate, and allowed to air dry for 2 hr. 

PC 12 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) 
containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin 
and 1 00 ug/ml streptomycin on a precoated 1 0 cm tissue culture dish. One to four 
split is done every three to four days. Cells are removed from the plates by 
scraping and resuspended with pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PC 12 using the 
Lipofectamine protocol described in Example 14. EGR-SEAP/PC12 stable cells 
are obtained by growing the cells in 3 00 ug/ml G4 1 8 . The G4 1 8-free medium is 
used for routine growth but every one to two months, the cells should be re- 
grown in 300 ug/ml G418 for couple of passages. 

To assay for neuronal activity, a 1 0 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
(Phosphate buffered saline). Then starve the cells in low serum medium (RPMI- 
1640 containing 1% horse serum and 0.5% FBS with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. 
Scrape off the cells from the plate, suspend the cells well in 2 ml low serum 
medium. Count the cell number and add more low serum medium to reach final 
cell density as 5x1 0 5 cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent 
to 1x10 s cells/well). Add 50 ul supernatant produced by Example 14, 37 degree 



WO 99/37660 



PCTAJS99/01313 



-158- 

C for 48 to 72 hr. As a positive control, a growth factor known to activate PCI 2 
cells through EGR can be used, such as 50 ng/ul of Neuronal Growth Factor 
(NGF). Over fifty-fold induction of SEAP is typically seen in the positive control 
wells. SEAP assay the supernatant according to Example 20. 

Example 19: High-Throughput Screening Assay for T-cell Activity 

NF-KB (Nuclear Factor KB) is a transcription factor activated by a wide 
variety of agents including the inflammatory cytokines DL-1 and TNF, CD30 and 
CD40, lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, 
and by expression of certain viral gene products. As a transcription factor, NF-KB 
regulates the expression of genes involved in immune cell activation, control of 
apoptosis (NF- KB appears to shield cells from apoptosis), B and T-cell 
development, anti-viral and antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- KB is retained in the cytoplasm with I- 
KB (Inhibitor KB). However, upon stimulation, I- KB is phosphorylated and 
degraded, causing NF- KB to shuttle to the nucleus, thereby activating 
transcription of target genes. Target genes activated by NF- KB include DL-2, 
IL-6, GM-CSF, ICAM-1 and class 1 MHC. 

Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-KB promoter element are used to screen the 
supernatants produced in Example 14. Activators or inhibitors of NF-KB would 
be useful in treating diseases. For example, inhibitors of NF-KB could be used to 
treat those diseases related to the acute or chronic activation of NF-KB, such as 
rheumatoid arthritis. 

To construct a vector containing the NF-KB promoter element, a PCR 
based strategy is employed. The upstream primer contains four tandem copies of 
the NF-KB binding site (GGGGACTTTCCC) (SEQ ID NO:91), 18 bp of 
sequence complementary to the 5' end of the SV40 early promoter sequence, and 
is flanked with an Xhol site: 
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5 1 : GCGGCCTCGAGGGGACTTTCCCGGGGACTTTCCGGGG ACTTTCCG 
GGACTTTCC ATCCTGCC ATCTC AATT AG: 3 ' (SEQ ID NO:92) 

The downstream primer is complementary to the 3' end of the SV40 
promoter and is flanked with a Hind HI site: 

5 ' : GC GGC AAGCTTTTTGC AAAGCCTAGGC : 3 ' (SEQ ID NO:93) 

PCR amplification i s performed using the S V40 promoter template present 
in the pB-gal:promoter plasmid obtained from Clontech. The resulting PCR 
fragment is digested with Xhol and Hind III and subcloned into BLSK2-. 
(Stratagene) Sequencing with the T7 and T3 primers confirms the insert contains 
the following sequence: 

S^CTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCGGGACT 
TTCCATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAA 
CTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCG 
CCCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGC 
CTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGA 
GGCCTAGGCTTTTGCAAAAAGCTT:3 , (SEQ ID NO:88) 

Next, replace the S V40 minimal promoter element present in the pSE AP2- 
promoter plasmid (Clontech) with this NF-KB/SV40 fragment using Xhol and 
Hindlll. However, this vector does not contain a neomycin resistance gene, and 
therefore, is not preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, the NF-KB/S V40/SEAP 
cassette is removed from the above NF-KB/SEAP vector using restriction 
enzymes Sail and NotI, and inserted into a vector containing neomycin resistance. 
Particularly, the NF-KB/SV40/SEAP cassette was inserted into pGFP-1 
(Clontech), replacing the GFP gene, after restricting pGFP-1 with Sail and Notl. 

Once NF-KB/S V40/SEAP/Neo vector is created, stable Jurkat T-cells are 
created and maintained according to the protocol described in Example 16. 
Similarly, the method for assaying supernatants with these stable Jurkat T-cells is 
also described in Example 1 6. As a positive control, exogenous TNF alpha (0.1,1, 
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10 ng) is added to wells H9, H10, and HI 1, with a 5-10 fold activation typically 
observed. 

Example 20: Assay for SEAP Activity 

As a reporter molecule for the assays described in Examples 16-19, SEAP 
5 activity is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to 

the following general procedure. The Tropix Phospho-light Kit supplies the 
Dilution, Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2. 5x Dilution Buffer and dispense 1 5 ul of 2. 5x 
dilution buffer into Optiplates containing 35 ul of a supernatant. Seal the plates 
10 with a plastic sealer and incubate at 65 degree C for 30 min. Separate the 

Optiplates to avoid uneven heating. 

Cool the samples to room temperature for 15 minutes. Empty the 
dispenser and prime with the Assay Buffer. Add 50 ml Assay Buffer and incubate 
at room temperature 5 min. Empty the dispenser and prime with the Reaction 
15 Buffer (see the table below). Add 50 ul Reaction Buffer and incubate at room 

temperature for 20 minutes. Since the intensity of the chemiluminescent signal is 
time dependent, and it takes about 10 minutes to read 5 plates on luminometer, 
one should treat 5 plates at each time and start the second set 10 minutes later. 
Read the relative light unit in the luminometer. Set H 1 2 as blank, and print 
20 the results. An increase in chemihiminescence indicates reporter activity. 



Reaction Buffer Formulation: 

g of plates Rxn buffer diluent (ml) CSPD (ml) 

10 60 3 

11 65 3.25 
25 12 70 3.5 

13 75 3.75 

14 80 4 

15 85 4.25 

16 90 4.5 
30 17 95 4.75 

18 100 5 
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19 


105 


5.25 


20 


110 


5.5 


21 


115 


5.75 


22 


120 


6 


23 


125 


6.25 


24 


130 


6.5 


25 


135 


6.75 


26 


140 


7 


27 


145 


7.25 


28 


150 


7.5 


29 


155 


7.75 


30 


160 


8 


31 


165 


8.25 


32 


170 


8.5 


33 


175 


8.75 


34 


180 


9 


35 


185 


9.25 


36 


190 


9.5 


37 


195 


9.75 


38 


200 


10 


39 


205 


10.25 


40 


210 


10.5 


41 


215 


10.75 


42 


220 


11 


43 


225 


11.25 


44 


230 


11.5 


45 


235 


11.75 


46 


240 


12 


47 


245 


12.25 


48 


250 


12.5 


49 


255 


12.75 


50 


260 


13 



Example 21: High-Throughput Screening Assay Identifying Changes in Small 
Molecule Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of 
small molecules, such as calcium, potassium, sodium, and pH, as well as alter 
membrane potential. These alterations can be measured in an assay to identify 
supernatants which bind to receptors of a particular cell. Although the following 
protocol describes an assay for calcium, this protocol can easily be modified to 
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detect changes in potassium, sodium, pH, membrane potential, or any other small 
molecule which is detectable by a fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") 
to measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be 
used instead of the calcium fluorescent molecule, fluo-3, used here. 

For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star 
black 96-well plate with clear bottom. The plate is incubated in a C0 2 incubator 
for 20 hours. The adherent cells are washed two times in Biotek washer with 200 
ul of HBSS (Hank's Balanced Salt Solution) leaving 1 00 ul of buffer after the final 
wash. 

A stock solution of 1 mg/ml fluo-3 is made in 10% pluronic acid DMSO. 
To load the cells with fluo-3, 50 ul of 12 ug/ml fluo-3 is added to each well. The 
plate is incubated at 37 degree C in a C0 2 incubator for 60 min. The plate is 
washed four times in the Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells 
are re-suspended to 2-5x1 0 6 cells/ml with HBSS in a 50-ml conical tube. 4 ul of 
1 mg/ml fluo-3 solution in 10% pluronic acid DMSO is added to each ml of cell 
suspension. The tube is then placed in a 37 degree C water bath for 30-60 min. 
The cells are washed twice with HBSS, resuspended to lxlO 6 cells/ml, and 
dispensed into a microplate, 1 00 ul/well. The plate is centrifuged at 1 000 rpm for 
5 min. The plate is then washed once in Denley CellWash with 200 ul, followed 
by an aspiration step to 100 ul final volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such 
as fluo-3. The supernatant is added to the well, and a change in fluorescence is 
detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for 
the following parameters: (1) System gain is 300-800 mW; (2) Exposure time 
is 0.4 second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 
530 nm; and (6) Sample addition is 50 ul. Increased emission at 530 nm indicates 
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an extracellular signaling event caused by the a molecule, either METH1 or 
METH2 or a molecule induced by METH1 or METH2, which has resulted in an 
increase in the intracellular Ca** concentration. 

Example 22: High-Throughput Screening Assay Identifying Tyrosine Kinase 
Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine 
Kinase RPTK) group are receptors for a range of mitogenic and metabolic growth 
factors including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor 
subfamilies. In addition there are a large family of RPTKs for which the 
corresponding ligand is unknown. Ligands for RPTKs include mainly secreted 
small proteins, but also membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor 
dimerization, resulting in transphosphorylation of the receptor subunits and 
activation of the cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases 
include receptor associated tyrosine kinases of the src-family (e.g., src, yes, Ick, 
lyn, fyn) and non-receptor linked and cytosolic protein tyrosine kinases, such as 
the Jak family, members of which mediate signal transduction triggered by the 
cytokine superfamily of receptors (e.g., the Interleukins, Interferons, GM-CSF, 
and Leptin). 

Because of the wide range ofknown factors capable of stimulating tyrosine 
kinase activity, identifying whether METH1 or METH2 or a molecule induced by 
METH1 or METH2 is capable of activating tyrosine kinase signal transduction 
pathways is of interest. Therefore, the following protocol is designed to identify 
such molecules capable of activating the tyrosine kinase signal transduction 
pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, DL). The plates are sterilized with two 30 minute rinses 
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with 100% ethanol, rinsed with water and dried overnight. Some plates are 
coated for 2 hr with 100 ml of cell culture grade type I collagen (50 mg/ml), 
gelatin (2%) or polylysine (50 mg/ml), all of which can be purchased from Sigma 
Chemicals (St. Louis, MO) or 10% Matrigel purchased from Becton Dickinson 
(Bedford,MA), or calf serum, rinsed with PBS arid stored at 4 degree C. Cell 
growth on these plates is assayed by seeding 5,000 cells/well in growth medium 
and indirect quantitation of cell number through use of alamarBlue as described 
by the manufacturer Alamar Biosciences, Inc. (Sacramento, CA) after 48 hr. 
Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are used to 
cover the Loprodyne Silent Screen Plates. Falcon Microtest III cell culture plates 
can also be used in some proliferation experiments. 

To prepare extracts, A43 1 cells are seeded onto the nylon membranes of 
Loprodyne plates (20,000/200ml/well) and cultured overnight in complete 
medium. Cells are quiesced by incubation in serum-free basal medium for 24 hr. 
After 5-20 minutes treatment with EGF (60ng/ml) or 50 ul of the supernatant 
produced in Example 14, the medium was removed and 100 ml of extraction 
buffer ((20 mM HEPES pH 7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 
mM Na3 V04, 2 mM Na4P207 and a cocktail of protease inhibitors (# 1 83 6 1 70) 
obtained from Boehringer Mannheim (Indianapolis, IN) is added to each well and 
the plate is shaken on a rotating shaker for 5 minutes at 4°C. The plate is then 
placed in a vacuum transfer manifold and the extract filtered through the 0.45 mm 
membrane bottoms of each well using house vacuum. Extracts are collected in 
a 96-well catch/assay plate in the bottom of the vacuum manifold and immediately 
placed on ice. To obtain extracts clarified by centrifugation, the content of each 
well, after detergent solubilization for 5 minutes, is removed and centrifuged for 
15 minutes at 4 degree C at 16,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although 
many methods of detecting tyrosine kinase activity are known, one method is 
described here. 
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Generally, the tyrosine kinase activity of a supernatant is evaluated by 
determining its ability to phosphorylate a tyrosine residue on a specific substrate 
(a biotinylated peptide). Biotinylated peptides that can be used for this purpose 
include PSK1 (corresponding to amino acids 6-20 of the cell division kinase cdc2- 
p34) and PSK2 (corresponding to amino acids 1 -1 7 of gastrin). Both peptides are 
substrates for a range of tyrosine kinases and are available from Boehringer 
Mannheim. 

The tyrosine kinase reaction is set up by adding the following components 
in order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg 2+ (5mM 
ATP/50mM MgC12), then lOul of 5x Assay Buffer (40mM imidazole 
hydrochloride, pH7.3, 40 mM beta-glycerophosphate, ImM EGTA, lOOmM 
MgCl 2 , 5 mM MnCl 2 , 0.5 mg/ml BSA), then 5ul of Sodium Vanadate(lmM), and 
then 5ul of water. Mix the components gently and preincubate the reaction mix 
at 30 degree C for 2 min. Initial the reaction by adding 1 Oul of the control enzyme 
or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 
1 20mm EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of 
reaction mixture to a microtiter plate (MTP) module and incubating at 37 degree 
C for 20 min. This allows the streptavadin coated 96 well plate to associate with 
the biotinylated peptide. Wash the MTP module with 300ul/well of PBS four 
times. Next add 75 ul of anti-phospotyrosine antibody conjugated to horse radish 
peroxidase(anti-P-Tyr-POD(0.5u/ml)) to each well and incubate at 37 degree C 
for one hour. Wash the well as above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) 
and incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
absoibance of the sample at 405 nm by using ELISA reader. The level of bound 
peroxidase activity is quantitated using an ELISA reader and reflects the level of 
tyrosine kinase activity. 
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Example 23: High- Throughput Screening Assay Identifying Phosphorylation 
Activity 

As a potential alternative and/or compliment to the assay of protein 
tyrosine kinase activity described in Example 22, an assay which detects activation 
(phosphorylation) of major intracellular signal transduction intermediates can also 
be used. For example, as described below one particular assay can detect tyrosine 
phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of 
other molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), MEK 
kinase, Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any 
other phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be 
detected by substituting these molecules for Erk- 1 or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELIS A 
plate with 0. 1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates 
are then rinsed with PBS and blocked with 3% BS A/PBS for 1 hr at RT. The 
protein G plates are then treated with 2 commercial monoclonal antibodies 
(lOOng/well) against Erk-1 and Erk-2 (1 hr at RT) (Santa Cruz Biotechnology). 
(To detect other molecules, this step can easily be modified by substituting a 
monoclonal antibody detecting any of the above described molecules.) After 3-5 
rinses with PBS, the plates are stored at 4 degree C until use. 

A43 1 cells are seeded at 20,000/well in a 96-well Loprodyne filterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the 
supernatants obtained in Example 14 for 5-20 minutes. The cells are then 
solubilized and extracts filtered directly into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. 
As a positive control, a commercial preparation of MAP kinase (1 Ong/well) is used 
in place 

of A431 extract. Plates are then treated with a commercial polyclonal (rabbit) 
antibody ( 1 ug/ml) which specifically recognizes the phosphorylated epitope of the 
Erk-1 and Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard 
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procedures. The bound polyclonal antibody is then quantitated by successive 
incubations with Europium-streptavidin and Europium fluorescence enhancing 
reagent in the Wallac DELFIA instrument (time-resolved fluorescence). An 
increased fluorescent signal over background indicates a phosphorylation by 
5 METH1 or METH2 or a molecule induced by METH1 or METH2. 

Example 24: Method of Determining Alterations in the METH1 or METH2 
Gene 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDNA is then generated 
10 from these RNA samples using protocols known in the art. (See, Sambrook.) 

The cDNA is then used as a template for PCR, employing primers surrounding 
regions of interest in SEQ ID NO:l. Suggested PCR conditions consist of 35 
cycles at 95 degree C for 30 seconds; 60-120 seconds at 52-58 degree C; and 60- 
120 seconds at 70 degree C, using buffer solutions described in Sidransky, D. et 

15 al, Science 252:106 (1991). 

PCR products are then sequenced using primers labeled at their 5' end with 
T4 polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre 
Technologies). The intron-exon borders of selected exons of METH1 orMETH2 
is also determined and genomic PCR products analyzed to confirm the results. 

20 PCR products harboring suspected mutations in METH1 or METH2 is then 

cloned and sequenced to validate the results of the direct sequencing. 

PCR products of METH1 or METH2 are cloned into T-tailed vectors as 
described in Holton, T.A and Graham, M.W., Nucleic Acids Research 19. \ 156 
(1991) and sequenced with T7 polymerase (United States Biochemical). Affected 

25 individuals are identified by mutations in METH1 or METH2 not present in 

unaffected individuals. 

Genomic rearrangements are also observed as a method of determining 
alterations in the METH1 or METH2 gene. Isolated genomic clones are nick- 
translated with digoxigenindeoxy-uridine S'-triphosphate (Boehringer Manheim), 
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and FISH performed as described in Johnson, Cg. etal., Methods Cell Biol. 35:73- 
99 (1 99 1 ). Hybridization with the labeled probe is carried out using a vast excess 
of human cot-1 DNA for specific hybridization to the METH1 or METH2 
genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and 
propidium iodide, producing a combination of C- and R-bands. Aligned images 
for precise mapping are obtained using a triple-band filter set (Chroma 
Technology, Brattleboro, VT) in combination with a cooled charge-coupled 
device camera (Photometries, Tucson, AZ) and variable excitation wavelength 
filters. (Johnson, Cv. et aL y Genet. Anal Tech Appl 5:75 (1991).) Image 
collection, analysis and chromosomal fractional length measurements are 
performed using the ISee Graphical Program System. (Inovision Corporation, 
Durham, NC.) Chromosome alterations of the genomic region of METH1 or 
METH2 (hybridized by the probe) are identified as insertions, deletions, and 
translocations. These METH1 or METH2 alterations are used as a diagnostic 
marker for an associated disease. 

Example 25: Method of Detecting Abnormal Levels ofMETHl orMETHl in 
a Biological Sample 

METH1 or METH2 polypeptides can be detected in a biological sample, 
and if an increased or decreased level ofMETHl or METH2 is detected, this 
polypeptide is a marker for a particular phenotype. Methods of detection are 
numerous, and thus, it is understood that one skilled in the art can modify the 
following assay to fit their particular needs. 

For example, antibody-sandwich ELISAs are used to detect METH1 or 
METH2 in a sample, preferably a biological sample. Wells of a microtiter plate 
are coated with specific antibodies to METH1 or METH2, at a final concentration 
of 0.2 to 10 ug/ml. The antibodies are either monoclonal or polyclonal and are 
produced by the method described in Example 13. The wells are blocked so that 
non-specific binding ofMETHl or METH2 to the well is reduced. 
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The coated wells are then incubated for > 2 hours at RT with a sample 
containing METH1 or METH2. Preferably, serial dilutions of the sample should 
be used to validate resuhs. The plates are then washed three times with deionized 
or distilled water to remove unbounded METH1 or METH2. 
5 Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 

concentration of 25-400 ng, is added and incubated for 2 hours at room 
temperature. The plates are again washed three times with deionized or distilled 
water to remove unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 
10 phosphate (NPP) substrate solution to each well and incubate 1 hour at room 

temperature. Measure the reaction by a microliter plate reader. Prepare a 
standard curve, using serial dilutions of a control sample, and plot METH1 or 
METH2 polypeptide concentration on the X-axis Cog scale) and fluorescence or 
absorbance of the Y-axis (linear scale). Interpolate the concentration of the 
1 5 METH1 or METH2 in the sample using the standard curve. 

Example 26: Formulating a Polypeptide 

The METHl or METH2 composition will be formulated and dosed in a 
fashion consistent with good medical practice, taking into account the clinical 
conditionoftheindividualpatient(especiaUy the side effects of treatment with the 
20 METHl or METH2 polypeptide alone), the site of delivery, the method of 

administration, the scheduling of adrninistration, and other factors known to 
practitioners. The "effective amount" for purposes herein is thus determined by 

such considerations. 

As a general proposition, the total pharmaceutical^ effective amount of 
25 METHl or METH2 administered parenteraUy per dose will be in the range of 

about lug/kg/day to 10 mg/kg/day of patient body weight, although, as noted 
above, this will be subject to therapeutic discretion. More preferably, this dose is 
atleartO.01 mg/kg/day, and most preferably for humans between about 0.01 and 
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1 mg/kg/day for the hormone. If given continuously, METH1 or METH2 is 
typically administered at a dose rate of about 1 ug/kg/hour to about 50 
ug/kg/hour, either by 1-4 injections per day or by continuous subcutaneous 
infusions, for example, using a mini-pump. An intravenous bag solution may also 
5 be employed. The length of treatment needed to observe changes and the interval 

following treatment for responses to occur appears to vary depending on the 
desired effect. 

Pharmaceutical compositions containing METH1 or METH2 are 
admmistered orally, rectally, parenterally, intracistemally, intravaginally, 
10 intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 

patch), bucally, or as an oral or nasal spray. "Pharmaceutical^ acceptable carrier" 
refers to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material 
or formulation auxiliary of any type. The term "parenteral" as used herein refers 
to modes of administration which include intravenous, intramuscular, 
15 intraperitoneal, intrasternd, subcutaneous and mtraarucularmjecuona^^ 

METH1 or METH2 is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi- 
permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained-release matrices include polylactides (U.S. Pat. No. 
20 3,773,919, EP 58,481), copolymers of L-glutamic acid and gamma-ethyl-L- 

ghitamate (Sidman, U. et al., Biopolymers 22:547-556 (1983)), poly (2- 
hydroxyethyl methacrylate) (R. Langer etal. , J. Biomed Mater. Res. 15: 1 67-277 
(1981), andR. Langer, Chem. Tech. 72.98-105 (1982)), ethylene vinyl acetate (R 
Langer et al.) or poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained- 
25 release compositions also include liposomally entrapped METH1 or METH2 

polypeptides. Liposomes containing the METH1 or METH2 are prepared by 
methods known per se:DE3,218,121;Epsteinera/.,Proc.^o//. Acad. Sci. USA 
52:3688-3692 (1985); Hwang et al, Proc. Natl. Acad. Sci. USA 77:4030-4034 
(1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese 
30 Pat. Appl. 83-1 18008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324. 
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Ordinarily, the liposomes are of the small (about 200-800 Angstroms) unilamellar 
type in which the lipid content is greater than about 30 mol. percent cholesterol, 
the selected proportion being adjusted for the optimal secreted polypeptide 
therapy. 

For parenteral administration, in one embodiment, METH 1 or METH2 is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutical* 
acceptable carrier, i.e., one that is non-toxic to recipients at the dosages and 
concentrations employed and is compatible with other ingredients of the 
formulation. For example, the formulation preferably does not include oxidizing 
agents and other compounds that are known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting METH1 or 
METH2 uniformly and intimately with liquid carriers or finely divided solid 
carriers or both. Then, if necessary, the product is shaped into the desired 
formulation. Preferably the carrier is a parenteral carrier, more preferably a 
solution that is isotonic with the blood of the recipient. Examples of such carrier 
vehicles include water, saline, Ringer's solution, and dextrose solution. Non- 
aqueous vehicles such as fixed oils and ethyl oleate are also useful herein, as well 
as liposomes. 

The carrier suitably contains minor amounts of additives such as 
substances that enhance isotonicity and chemical stability. Such materials are non- 
toxic to recipients at the dosages and concentrations employed, and include 
buffers such as phosphate, citrate, succinate, acetic acid, and other organic acids 
or their salts; antioxidants such as ascorbic acid; low molecular weight (less than 
about ten residues) polypeptides, e.g., polyarginine or tripeptides; proteins, such 
as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, 
or arginine; monosaccharides, disaccharides, and other carbohydrates including 
cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such as 
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EDTA; sugar alcohols such as mannitol or sorbitol; counterfoils such as sodium; 
and/or nonionic surfactants such as polysorbates, poloxamers, or PEG. 

METH1 or METH2 is typically formulated in such vehicles at a 
concentration of about 0. 1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH 
of about 3 to 8. It will be understood that the use of certain of the foregoing 
excipients, carriers, or stabilizers will result in the formation of polypeptide salts. 

METH1 or METH2 used for therapeutic administration can be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes 
(e g., 0.2 micron membranes). Therapeutic polypeptide compositions generally 
are placed into a container having a sterile access port, for example, an 
intravenous solution bag or vial having a stopper pierceable by a hypodermic 
injection needle. 

METH1 or METH2 polypeptides ordinarily will be stored in unit or multi- 
dose containers, for example, sealed ampoules or vials, as an aqueous solution or 

15 as a lyophilized formulation for reconstitution. As an example of a lyophilized 

formulation, 10-ml vials are filled with 5 ml of sterile-filtered 1% (w/v) aqueous 
METH1 or METH2 polypeptide solution, and the resulting mixture is lyophilized. 
The infusion solution is prepared by reconstituting the lyophilized METH1 or 
METH2 polypeptide using bacteriostatic Water-for-Injection. 

20 The invention also provides a pharmaceutical pack or kit comprising one 

ormorecontainersfiUedwithoneormoreofthemgredientsofthepharmaceutical 

compositions of the invention. Associated with such containers) can be a notice 
in the form prescribed by a governmental agency regulating the manufacture, use 
or sale of pharmaceuticals or biological products, which notice reflects approval 
25 by the agency of manufacture, use or sale for human administration. In addition, 

METH1 or METH2 may be employed in conjunction with other therapeutic 
compounds. 
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Example 27: Method of Treating Decreased Levels ofMETHl or METH2 

The present invention relates to a method for treating an individual in need 
of a decreased level of METH1 or METH2 activity in the body comprising, 
administering to such an individual a composition comprising a therapeutically 
5 effectiveamountofMETHl or METH2 antagonist. Preferred antagonists for use 

in the present invention are METH1 or METH2-specific antibodies. 

Moreover, it will be appreciated that conditions caused by a decrease in 
the standard or normal expression level ofMETHl or METH2 in an individual can 
be treated by adininistering METH1 or METH2, preferably in the secreted form. 
10 Thus, the invention also provides a method of treatment of an individual in need 

of an increased level ofMETHl or METH2 polypeptide comprising administering 
to such an individual a pharmaceutical composition comprising an amount of 
METH1 or METH2 to increase the activity level ofMETHl or METH2 in such 
an individual. 

15 For example, a patient with decreased levels of METH1 or METH2 

polypeptide receives a daily dose 0.1-100 ug/kg of the polypeptide for six 
consecutive days. Preferably, the polypeptide is in the secreted form. The exact 
details of the dosing scheme, based on administration and formulation, are 
provided in Example 26. 

20 Example 28: Method of Treating Increased Levels ofMETHl or METH2 

The present invention also relates to a method for treating an individual in 
need of an increased level ofMETHl or METH2 activity in the body comprising 
administering to such an individual a composition comprising a therapeutically 
effective amount ofMETHl or METH2 or an agonist thereof. 
25 Antisense technology is used to inhibit production ofMETHl or METH2. 

This technology is one example of a method of decreasing levels ofMETHl or 
METH2 polypeptide, preferably a secreted form, due to a variety of etiologies, 
such as cancer. 
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cctgctttga agaagaaaat tgaatgaaat 
tataagatag caaactatat ccaagttgtg 
taccttcaac tacagcatat tattctaggt 
gtgaaccctg cttccattga cctaggccag 
cttacagata gctatatgaa ctgttgaaaa 
gacaaggaga ctacttgtag cctgggagga 
tttaaccacc atggtgcatg ttgtctccac 
actctacctc tgtctgtgtg caagggcagt 
actccaaaaa gaagtttgat aagtgtggcg 
agatgtcagg aatagtcact agtacaaggt 
cacgctgtgt tgctcagttg gtccctagca 
atgtacctgt agtcccagtg tatgggagac 
tcaggtcatc tttgctacat agtgactttc 
ctggaaacac aaacacacac aaaacaatta 
agtaggctga cttgggagga tggttgctat 
agttccagta taacctggcc cacaagtgaa 
aaaacacatg ctttggagac cctgtaattt 
gtcacattta aaagaattgc ctatttttaa 
catatacctg ttttctttgt ccacagacct 
ggagccacca acattgaagt gaaacatcgg 
tttctggcta ttagagccgc tgatggtacc 
acactagagc aagacctcac ctacaaaggt 
gcgctggaga gaatccgcag ctttagtcca 
atggtaggcc atgctctccg acccaaaatt 
gagtcattca acgccattcc cacattttct 
tccaagacat gcggctcagg ttggcagaga 



ttgcttaagc ttgtcatgta ttcttagcat 594 0 
gatgaagtat ttagcaagtg atttataaag 6000 
actgaccatg gaacaataat cagtgtgaca 6060 
caaatatata aaatcaagac atttataagc 6120 
agccaaaatg aaagtgaaca tgtggcacgt 6180 
gagcattccc agttgccatc acatcagatg 624 0 
aggttgtaga tggcactccc tgtagtccag 6300 
gtgtgaaagc tggctgtgat cgcatcatag 6360 
tttgtggagg aaacggttcc acatgcaaga 6420 
gagtttcaga acgctcactt ctgcagtaga 6480 
tctacaagac cttgggttca atccgcatgc 6540 
agagacaagt gtgacaagac ggtcagatgt 6600 
agttcacctt ggggaacatg aaaaacctga 6660 
acccaggtac ttcatgtaat cccagtgttc 6720 
aaggcctagg ttagcttggt ctacataatg 6780 
ccctaaagtt aattaatcga cacatgaaac 6840 
tgatatacga ttttgtagga ctaaggaaaa 6900 
agcaatgtga ttgattaact cattgaaaga 6960 
gggtatcatg acattgtcac aattcctgct 7020 
aatcaaaggg ggtccagaaa caatggcagc 7080 
tatattctga atggaaactt cactctgtcc 7140 
actgtcttaa ggtacagtgg ttcctcggct 7200 
ctcaaagaac ccttaaccat ccaggttctt 7260 
aaattcacct actttatgaa gaagaagaca 7320 
gagtgggtga ttgaagagtg gggggagtgc 7380 
agagtagtgc agtgcagaga cattaatgga 7440 
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For example, a patient diagnosed with abnormally increased levels of 
METH1 or METH2 is administered intravenously antisense polynucleotides at 
05 1 0 1 5 2 0 and 3.0 mg/kg day for 21 days. This treatment is repeated after 
a 7-day rest period if the treatment was well tolerated. The formulation of the 
antisense polynucleotide is provided in Example 26. 

Example 29: Method of Treatment Using Gene Therapy - Ex Vm> 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing METH1 or METH2 polypeptides, onto a patient. Generally, 
fibroblasts are obtained from a subject by skin biopsy. The resulting tissue » 
placedmtissue-culmremediumandseparatedmtosmallpiece, Small chunks of 
the tissue are placed on a wet surface of a tissue culture flask, approximately ten 
pieces are placed in each flask. The flask is turned upside down, closed tight and 
left at room temperature over night. After 24 hours at room temperature, the 
flask is inverted and the chunks of tissue remain fixed to the bottom of the flask 
and fresh media (e.g., Ham's F12 media, with 10% FBS, penicillin and 
streptomycin) is added. The flasks are then incubated at 37 degree C for 

approximately one week. 

At this time, fresh media is added and subsequently changed every several 
days.Afteranadditionaltwoweeksmculture,amonolayeroffibrob^ 

The monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et «L. DNA 7:219-25 (1988)), flanked by the 
long terminal repeats of the Moloney murine sarcoma virus, is digested with 
EcoRl andHindlll and subsequently treated with calf intestinal phosphatase. The 
linear vector is fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding METH1 or METH2 can be amplified using PCR 
primers which correspond to the 5' and 3' end sequences respectively as set forth 
in Example 5. Preferably, the 5' primer contains an EcoRI site and the 3' pnmer 
includes a Hindffl site. Equal quantities of the Moloney murine sarcoma vkus 
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linear backbone and the amplified EcoRI and Hindlll fragment are added together, 
in the presence of T4 DNA ligase. The resulting mixture is maintained under 
conditions appropriate for ligation of the two fragments. The ligation mixture is 
then used to transform bacteria HB101, which are then plated onto agar 
containing kanamycin for the purpose of confirming that the vector contains 
properly inserted METH1 or METH2. 

The amphotropic pA3 1 7 or GP+aml 2 packaging cells are grown in tissue 
cultureto confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 
10% calf serum (CS), penicillin and streptomycin. The MSV vector containing 
the METH1 or METH2 gene is then added to the media and the packaging cells 
transduced with the vector. The packaging cells now produce infectious viral 
particles containing the METH1 or METH2 gene(the packaging cells are now 
referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, 
the media is harvested from a 1 0 cm plate of confluent producer cells. The spent 
media, containing the infectious viral particles, is filtered through a millipore filter 
to remove detached producer cells and this media is then used to infect fibroblast 
cells. Media is removed from a sub-confluent plate of fibroblasts and quickly 
replaced with the media from the producer cells. This media is removed and 
replaced with fresh media. If the titer of virus is high, then virtually all fibroblasts 
will be infected and no selection is required. If the titer is very low, then it is 
necessary to use a retroviral vector that has a selectable marker, such as neo or 
his. Once the fibroblasts have been efficiently infected, the fibroblasts are analyzed 
to determine whether METH1 or METH2 protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone 
or after having been grown to confluence on cytodex 3 microcarrier beads. 
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ExampleSO: Method of Treatment Using Gene Therapy -In Vivo 

Another aspect of the present invention is using in vivo gene therapy 
methods to treat disorders, diseases and conditions. The gene therapy method 
relatesto the introduction of naked nucleic acid (DNA, RNA, and antisenseDNA 
or RNA) METH1 or METH2 sequences into an animal to increase or decrease the 
expression of the METH1 or METH2 polypeptide. The METH1 or METH2 
polynucleotide may be operatively linked to a promoter or any other genetic 
elements necessary for the expression of the METH1 or METH2 polypeptide by 
the target tissue Such gene therapy and delivery techniques and methods are 
knownintheart, see,for example, WO90/1 1092, WG98/1 1779; U.S. PatentNO. 
5693622,5705151,5580859;TabataH.era/.(1997)Cor^va SC . J Re5.55r3;:470- 

479, Chao, J et al. (1997) Pharmacol. Res. 35(6) 5X1-522, Wolff (1997) 
Neitromuscul. Disord 7(5)314-31*, Schwartz, B. et al. (1996) Gene Ther. 
3^:405-411, Tsurumi Y. et al. (1996) Circulation 94(12) 32*1-3290 

(incorporated herein by reference). 

The METH1 or METH2 polynucleotide constructs may be delivered by 
any method that delivers injectable materials to the cells of an animal, such as, 
injection into the interstitial space of tissues (heart, muscle, skin, lung, liver, 
intestine and the like). The METH1 or METH2 polynucleotide constructs can be 
delivered in a pharmaceutically acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that 
are free from any delivery vehicle that acts to assist, promote, or facilitate entry 
into the cell, including viral sequences, viral particles, liposome formulations, 
lipofectinor precipitating agents and the like. However, the METH1 orMETffi 
polynucleotides may also be delivered in liposome formulations (such as those 
taught inFelgnerPL. etal. (\995) Ann. NY Acad. Sci. 772:126-139 and Abdallah 
B. et al. (1995) Biol. Cell 85(1):\-1) which can be prepared by methods well 
known to those skilled in the art. 
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The METH1 or METH2 polynucleotide vector constructs used in the 
gene therapy method are preferably constructs that will not integrate into the host 
genome nor will they contain sequences that allow for replication. Any strong 
promoter known to those skilled in the art can be used for driving the expression 
of DNA. Unlike other gene therapies techniques, one major advantage of 
introducing naked nucleic acid sequences into target cells is the transitory nature 
of the polynucleotide synthesis" in the cells. Studies have shown that non- 
replicating DNA sequences can be introduced into cells to provide production of 
the desired polypeptide for periods of up to six months. 

The METH1 or METH2 polynucleotide construct can be delivered to the 
interstitial space of tissues within the an animal, including of muscle, skin, brain, 
lung, liver, spleen, bone marrow, thymus, heart, lymph, blood, bone, cartilage, 
pancreas, kidney, gall bladder, stomach, intestine, testis, ovary, uterus, rectum, 
nervous system, eye, gland, and connective tissue. Interstitial space of the tissues 
1 5 comprises the intercellular fluid, mucopolysaccharide matrix among the reticular 

fibers of organ tissues, elastic fibers in the walls of vessels or chambers, collagen 
fibers of fibrous tissues, or that same matrix within connective tissue ensheathing 
muscle cells or in the lacunae of bone. It is similarly the space occupied by the 
plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery 
20 to the interstitial space of muscle tissue is preferred for the reasons discussed 

below. They may be conveniently delivered by injection into the tissues 
comprising these cells. They are preferably delivered to and expressed in 
persistent, non-dividing cells which are differentiated, although delivery and 
expression may be achieved in non-differentiated or less completely differentiated 
25 cells, such as, for example, stem cells of blood or skin fibroblasts. In vivo muscle 

cells are particularly competent in their ability to take up and express 
polynucleotides. 

For the naked METH1 or METH2 polynucleotide injection, an effective 
dosage amount of DNA or RNA will be in the range of from about 0.05 g/kgbody 
30 weight to about 50 mg/kg body weight. Preferably the dosage will be from about 
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0.005 mg/kg to about 20 mg/kg and more preferably from about 0.05 mg/kg to 
about 5 mg/kg. Of course, as the artisan of ordinary skill will appreciate, this 
dosage will vary according to the tissue site of injection. The appropriate and 
effective dosage of nucleic acid sequence can readily be determined by those of 
5 ordinary skill in the art and may depend on the condition being treated and the 

route of administration. The preferred route of administration is by the parenteral 
route of injection into the interstitial space of tissues. However, other parenteral 
routes may also be used, such as, inhalation of an aerosol formulation particularly 
for delivery to lungs or bronchial tissues, throat or mucous membranes of the 

10 nose. In addition, naked METH1 or METH2 polynucleotide constructs can be 

delivered to arteries during angioplasty by the catheter used in the procedure. 

The dose response effects of injected METH1 or METH2 polynucleotide 
in muscle in vivo is determined as follows. Suitable METH 1 or METH2 template 
DNA for production of mRNA coding for METH1 or METH2 polypeptide is 

15 prepared in accordance with a standard recombinant DNA methodology. The 

template DNA, which may be either circular or linear, is either used as naked 
DNA or complexed with liposomes. The quadriceps muscles of mice are then 
injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 

20 intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1 .5 cm incision is made 

on the anterior thigh, and the quadriceps muscle is directly visualized. The 
METH1 or METH2 template DNA is injected in 0. 1 ml of carrier in a 1 cc syringe 
through a 27 gauge needle over one minute, approximately 0.5 cm from the distal 
insertion site of the muscle into the knee and about 0.2 cm deep. A suture is 

25 placed over the injection site for future localization, and the skin is closed with 

stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are 
prepared by excising the entire quadriceps. Every fifth 1 5 urn cross-section of the 
individual quadriceps muscles is histochemically stained for METH1 or METH2 

30 protein expression. A time course for METH 1 or METH2 protein expression may 
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be done in a similar fashion except that quadriceps from different mice are 
harvested at different times. Persistence of METH1 or METH2 DNA in muscle 
following injection may be determined by Southern blot analysis after preparing 
total cellular DNA and HIRT supernatants from injected and control mice. The 
results of the above experimentation in mice can be use to extrapolate proper 
dosages and other treatment parameters in humans and other animals using 
METH1 or METH2 naked DNA. 

It will be clear that the invention may be practiced otherwise than as 
particularly described in the foregoing description and examples. 

Numerous modifications and variations of the present invention are 
possible in light of the above teachings and, therefore, are within the scope of the 
appended claims. 

The entire disclosure of all publications (including patents, patent 
applications, journal articles, laboratory manuals, books, or other documents) 
cited herein are hereby incorporated by reference. 
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Applicant's or agent's file 


International application No: TBA 


reference number: 1 488. 1 07PC02 





INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCT Rule 13 to) 



A. The indications made below relate to the microorganism referred to in the description on page 22. lines 16-17 . 



a IDENTIFICATION OF DEPOSIT Fimhcr deposits ^ identified on an additional sheet B 

Name of depository institution 

American Type Culture Collection 



Address of depository institution (including postal code and country) 

10801 University Boulevard formerly at: 12301 Parklawn Drive 

Manassas, Virginia 201 10-2209 Rockville, Maryland 20852 

United States of America United States of America 



Date of deposit 




Accession Number 


15 January 1998 




209581 



C. ADDITIONAL INDICATIONS f leave blank if not applicable) information is continued on an additional sheet □ 



DNA plasmid HOUCQ17 



D. DESIGN ATED STATES FOR WHICH INDICATIONS ARE MADE fif the indications are not for all designated Stalest 



E. SEPARATE FURNISHING OF INDICATIONS a,™ Mo* ,t m* apptraMo 

The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g.. 
"Accession ft umber of Deposit") 



1 ror receiving Olhce use only 




□ This sheet was received with the international application 


□ This sheet was received by the Internal tonal Bureau on: 


Authorized officer 


Authorized officer 



h>rm PCT/R0/134 (JuK !W> 



107expsoljp.pci 
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Appiicant's or agent's file 
reference number: I488.107PC02 



International application No: TBA 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 136/5) 



A. The indications made below relate to the microorganism referred to in the description on page 12. lines 25-26 . 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet □ 



Name of depository institution 
American Type Culture Collection 



Address of depository institution (including postal code and country) 



10801 University Boulevard 
Manassas. Virginia 201 10-2209 
United States of America 



formerly at: 



12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit 


Accession Number 


15 January 1998 


209582 



C. ADDITIONAL INDICATIONS (leave blank if not applicable} jy{ iS information is continued on an additional sheet □ 



DNA plasmid HCE4D69 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all dcvenaied States! 



E. SEPARATE FURNISHING OF INDICATIONS tkw u** it m* apphn*k> 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g. 
"Accession Sumber of Deposit") 



For receiving Office use only 



□ This sheet was received with the international application 



Authorized officer 



For International Bureau use onl\ 



O This sheet was received bv the Internal ional Bureau on: 



Authorized officer 



Form PCITRO/IS-I Uut> W2) 



I07expsoljp.2pci 
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What Is Claimed Is: 

1. An isolated nucleic acid molecule comprising a polynucleotide 
selected from the group consisting of: 

(a) a polynucleotide encoding a polypeptide comprising amino 
acids 1 to 950 in SEQ ID NO:2; 

(b) a polynucleotide encoding a polypeptide comprising amino 
acids 2 to 950 in SEQ ID NO:2; 

(c) a polynucleotide encoding a polypeptide comprising amino 
acids 29 to 950 in SEQ ID NO:2; 

(d) a polynucleotide encoding a polypeptide comprising amino 
acids 30 to 950 in SEQ ID NO:2; 

(e) a polynucleotide comprising a nucleotide sequence 
encoding the METH 1 polypeptide having the amino acid sequence encoded by the 
cDNA clone contained in ATCC Deposit No. 209581; 

(f) a polynucleotide comprising a nucleotide sequence 
encoding the mature METH1 polypeptide having the amino acid sequence 
encoded by the cDNA clone contained in ATCC Deposit No. 209581 ; 

(g) a polynucleotide encoding a polypeptide comprising amino 
acids 1 to 890 in SEQ ID NO:4; 

(h) a polynucleotide encoding a polypeptide comprising amino 
acids 2 to 890 in SEQ ID NO:4; 

(i) a polynucleotide encoding a polypeptide comprising amino 
acids 24 to 890 in SEQ ID NO:4; 

(j) a polynucleotide encoding a polypeptide comprising amino 
acids 1 12 to 890 in SEQ ID NO:4; 

(k) a polynucleotide comprising a nucleotide sequence 
encoding the METH2 polypeptide having the amino acid sequence encoded by the 
cDNA clone contained in ATCC Deposit No. 209582; 
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(1) a polynucleotide comprising a nucleotide sequence 
encoding the mature METH2 polypeptide having the amino acid sequence 
encoded by the cDNA clone contained in ATCC Deposit No. 209582; 

(m) a polynucleotide variant created by altering a 
polynucleotide of (a), (b), (c), (d), (e), (f), (g), (h), (i), 0), (k), or (1), wherein: 

(i) said altering includes a nucleotide insertion, 
deletion, or substitution, or any combination thereof; and 

(ii) the number of alterations is equal to or less than 5% 
of the total number of nucleotides present in the unaltered polynucleotide; 

(n) a polynucleotide encoding amino acids 23 5 to 459 in SEQ 



IDNO:2; 
IDNO:2; 
ID NO:2; 
ID NO:2; 
IDNO:2; 
E)NO:2; 
IDNO:2; 
ID NO:4; 
IDNO:4; 
IDNO:4; 



(o) a polynucleotide encoding amino acids 460 to 544 in SEQ 

(p) a polynucleotide encoding amino acids 545 to 598 in SEQ 

(q) a polynucleotide encoding amino acids 841 to 894 in SEQ 

(r) a polynucleotide encoding amino acids 895 to 934 in SEQ 

(s) a polynucleotide encoding amino acids 536to613in SEQ 

(t) a polynucleotide encoding amino acids 549 to 563 in SEQ 

(u) a polynucleotide encoding amino acids 2 1 4 to 43 9 in SEQ 

(v) a polynucleotide encoding amino acids 440 to 529 in SEQ 

(w) a polynucleotide encoding amino acids 530 to 583 in SEQ 
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(x) a polynucleotide encoding amino acids 837 to 890 in SEQ 

IDNO:4;* 

(y) a polynucleotide encoding amino acids 280 to 606 in SEQ 

IDNO:4); 

(z) a polynucleotide encoding amino acids 529 to 548 in SEQ 

ID NO:4; and 

(aa) a nucleotide sequence complementary to any of the 
nucleotide sequences in (a), (b), (c), (d), (e), (f), (g), (h), (i), (j), (k), (1), (m), (n), 
(o), (p), (q), (r), (s), (t), (u), (v), (w), (x), (y), or (z). 



10 2. An isolated nucleic acid molecule comprising a polynucleotide 

which encodes the amino acid sequence of an epitope-bearing portion of the 
METH1 polypeptide of SEQ ID NO:2 or the METH2 polypeptide of SEQ ID 
NO:4. 



3. An isolated nucleic acid molecule, comprising a polynucleotide 
1 5 selected from the group consisting of: 

(a) 50 contiguous nucleotides of the coding region of SEQ ID 
NO: 1, provided that said nucleotide sequence is not any one of SEQ ID NOs: 14- 
41, or any subfragment thereof; and 

(b) a nucleotide sequence complementary to the nucleotide 
20 sequence in (a). 

4. An isolated nucleic acid molecule, comprising a polynucleotide 
selected from the group consisting of: 

(a) 50 contiguous nucleotides of the coding region of SEQ ID 
NO:3, provided that said nucleotide sequence is not SEQ ID NOs: 19-22, 24, 42- 

25 77, or any subfragment thereof; and 

(b) a nucleotide sequence complementary to the nucleotide 
sequence in (a). 
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5. A method for making a recombinant vector comprising inserting 
an isolated nucleic acid molecule of claim 1 into a vector in operable linkage to a 
promoter. 

6. A recombinant vector produced by the method of claim 5. 

7. A method of making a recombinant host cell comprising 
introducing the recombinant vector of claim 6 into a host cell. 

8. A recombinant host cell produced by the method of claim 7. 

9. A recombinant method for producing a METH1 or METH2 
polypeptide, comprising culturing the recombinant host cell of claim 8 under 
conditions such that said polypeptide is expressed and recovering said polypeptide. 

10. An isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of: 

(a) amino acids 1 to 950 in SEQ ID NO:2; 

(b) amino acids 2 to 950 in SEQ ID NO:2; 

(c) amino acids 29 to 950 in SEQ ID NO:2; 

(d) amino acids 30 to 950 in SEQ ID NO:2; 

(d) theaminoacidsequenceoftheMETHl polypeptide having 
the amino acid sequence encoded by the cDNA clone contained in ATCC Deposit 
No. 209581; 

(e) the amino acid sequence of the mature METH1 polypeptide 
having the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 209581; 

(f) amino acids 1 to 890 in SEQ ID NO:4; 

(g) amino acids 2 to 890 in SEQ ID NO:4; 

(h) amino acids 24 to 890 in SEQ ID NO:4; 
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15 



20 
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(i) amino acids 1 1 2 to 890 in SEQ ID NO:4; 

(j) an amino acid sequence of the METH2 polypeptide having 
the amino acid sequence encoded by the METH2 cDNA clone contained in ATCC 
Deposit No. 209582; 

(k) an amino acid sequence of the mature METH2 polypeptide 
having the amino acid sequence encoded by the METH2 cDNA clone contained 
in ATCC Deposit No. 209582; 

(1) the amino acid sequence of a polypeptide variant created 
by altering a polypeptide of (a), (b), (c), (d), (e), (0, (g), (h), (i), 0), or (k), 
wherein: 

(i) said altering includes an amino acid insertion, 
deletion, or substitution, or any combination thereof; and 

(ii) the number of alterations is equal to or less than 5% 
of the total number of amino acids present in the unaltered amino acid sequence; 

(m) amino acids 235 to 459 in SEQ ID NO:2; 

(n) amino acids 460 to 544 in SEQ ID NO:2; 

(o) amino acids 545 to 598 in SEQ ID NO:2; 

(p) amino acids 84 1 to 894 in SEQ ID NO:2; 

(q) amino acids 895 to 934 in SEQ ID NO:2; 

(r) amino acids 536 to 613 in SEQ ID NO:2; 

(s) amino acids 549 to 563 in SEQ ID NO:2; 

(t) amino acids 214 to 439 in SEQ ID NO:4; 

(u) amino acids 440 to 529 in SEQ ID NO:4; 

(v) amino acids 530 to 583 in SEQ ID NO:4; 

(w) amino acids 837 to 890 in SEQ ID NO:4; 

(x) amino acids 280 to 606 in SEQ ID NO:4; 

(y) amino acids 529 to 548 in SEQ ID NO:4; 

(z) the amino acid sequence of an epitope-bearing portion of 
any one of the polypeptides of (a), (b), (c), (d), (e), (f), (g), (h), (i), 0), 00, 0), 
(m), (n), (o), (p), (q), (r), (s), (t), (u), (v), (w), (x), or (y). 
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11. The isolated polypeptide of claim 10, which is produced in a 
recombinant host cell. 

12. The isolated polypeptide of claim 1 1, wherein said recombinant 
host cell is mammalian. 

13. An isolated nucleic acid molecule comprising a polynucleotide 
encoding a METH1 or METH2 polypeptide wherein, except for one to fifty 
conservative amino acid substitutions, said polypeptide has a sequence selected 
from the group consisting of: 

(a) amino acids from about 1 to about 950 in SEQ ID NO.2; 

(b) amino acids from about 2 to about 950 in SEQ ID NO:2; 

(c) amino acids from about 29 to about 950 in SEQ ID NO:2; 

(d) amino acids from about 30 to about 950 in SEQ ID NO:2; 

(e) the amino acid sequence of the METH1 polypeptide as 
encoded by the cDNA clone contained in ATCC Deposit No. 209581 ; 

(f) the amino acid sequence of the mature METH 1 polypeptide 
as encoded by the cDNA clone contained in ATCC Deposit No. 209581; 

(g) amino acids from about 1 to about 890 in SEQ ID NO:4; 

(h) amino acids from about 2 to about 890 in SEQ ID NO:4; 

(i) amino acids from about 24 to 890 in SEQ ID NO:4; 

Q amino acids from about 1 1 2 to about 890 in SEQ ID NO:4; 

(k) the amino acid sequence of the METH2 polypeptide as 
encoded by the cDNA clone contained in ATCC Deposit No. 209582; and 

(1) the amino acid sequence of the mature METH2 polypeptide 
as encoded by the cDNA clone contained in ATCC Deposit No. 209582. 

14. An isolated polypeptide wherein, except for one to fifty 
conservative amino acid substitutions, said polypeptide has a sequence selected 
from the group consisting of: 
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(a) 


amino acids from about 1 to about 950 in SEQ ID NO:2; 


00 


amino acids from about 2 to about 950 in SEQ ID NO:2; 


(c) 


amino acids from about 29 to about 950 in SEQ ID NO:2; 


(<0 


amino acids from about 30 to about 950 in SEQ ID NO:2; 


(e) 


the amino acid sequence of the METH 1 polypeptide having 


the amino acid sequence encoded by the cDNA clone contained in ATCC Deposit 


No. 209581; 




(f) 


the amino acid sequence of the mature METH 1 polypeptide 


having the amino acid sequence encoded by the cDNA clone contained in ATCC 


Deposit No. 209581; 




(g) 


amino acids from about 1 to about 890 in SEQ ID NO:4; 


CO 


amino acids from about 2 to about 890 in SEQ ID NO:4; 


0) 


amino acids from about 24 to about 890 in SEQ ID NO:4; 


0) 


amino acids from about 1 12 to about 890 in SEQ ID NO:4; 


(k) 


the amino acid sequence of the METH2 polypeptide having 



the amino acid sequence encoded by the cDNA clone contained in ATCC Deposit 
No. 209582; 



(1) the amino acid sequence of the mature METH2 polypeptide 
having the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 209582; and 

(m) the amino acid sequence of an epitope-bearing portion of 
any one of the polypeptides of (a), (b), (c), (d), (e), (f), (g), (h), (i), Q), (k), or (1). 

15. An isolated nucleic acid molecule comprising a polynucleotide at 
least 95% identical to a polynucleotide selected from the group consisting of: 

(a) a polynucleotide encoding a polypeptide comprising amino 
acids 1 to 950 in SEQ ID NO:2; 

(b) a polynucleotide encoding a polypeptide comprising amino 
acids 2 to 950 in SEQ ID NO:2; 
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(c) a polynucleotide encoding a polypeptide comprising amino 
acids 29 to 950 in SEQ ID NO:2; 

(d) a polynucleotide encoding a polypeptide comprising amino 
acids 30 to 950 in SEQ ID NO:2; 

5 (e) a polynucleotide comprising a nucleotide sequence 

encoding the METH1 polypeptide having the amino acid sequence encoded by the 
cDNA clone contained in ATCC Deposit No. 20958 1 ; 

(f) a polynucleotide comprising a nucleotide sequence 
encoding the mature METH1 polypeptide having the amino acid sequence 

1 0 encoded by the cDNA clone contained in ATCC Deposit No. 20958 1 ; 

(g) a polynucleotide encoding a polypeptide comprising amino 
acids 1 to 890 in SEQ ID NO:4; 

(h) a polynucleotide encoding a polypeptide comprising amino 
acids 2 to 890 in SEQ ED NO:4; 

15 (i) a polynucleotide encoding a polypeptide comprising amino 

acids 24 to 890 in SEQ ID NO:4; 

(j) a polynucleotide encoding a polypeptide comprising amino 
adds 1 12 to 890 in SEQ ID NO:4; 

(k) a polynucleotide comprising a nucleotide sequence 
20 encoding the METH2 polypeptide having the amino acid sequence encoded by the 

cDNA clone contained in ATCC Deposit No. 209582; 

(1) a polynucleotide comprising a nucleotide sequence 
encoding the mature METH2 polypeptide having the amino acid sequence 
encoded by the cDNA clone contained in ATCC Deposit No. 209582; and 
25 (m) a nucleotide sequence complementary to any of the 

nucleotide sequences in (a), (b), (c), (d), (e), (f), (g), (h), (i), (j), 00, or (1), 
wherein 

said % identity is calculated using the FASTDB computer program, with 
the parameters: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, Joining 
30 Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, 



WO 99/37660 



PCT/US99/01313 



-188- 

Gap Size Penalty=0.05, Window Size=500 or the length of the subject nucleotide 
sequence, whichever is shorter. 



16. An isolated polypeptide comprising a polypeptide having 95% 
identity to a polypeptide having an amino acid sequence selected from the group 
consisting of: 



(a) 


amino acids from about 1 to about 950 in SEQ ID NO:2; 


0>) 


amino acids from about 2 to about 950 in SEQ ID NO:2; 


(c) 


amino acids from about 29 to about 950 in SEQ ID NO:2; 


(d) 


amino acids from about 30 to about 950 in SEQ ID NO:2; 


(e) 


the amino acid sequence of the METH 1 polypeptide having 


the amino acid sequence encoded by the cDNA clone contained in ATCC Deposit 


No. 209581; 




(f) 


the amino acid sequenceofthe mature METH1 polypeptide 


having the amino acid sequence encoded by the cDNA clone contained in ATCC 


Deposit No. 209581; 




(g) 


amino acids from about 1 to about 890 in SEQ ID NO:4; 


00 


amino acids from about 2 to about 890 in SEQ ID NO:4; 


CO 


amino acids from about 24 to about 890 in SEQ ID NO:4; 


0) 


amino acids from about 1 12 to about 890 in SEQ ID NO:4; 


00 


the amino acid sequence of theMETH2 polypeptide having 



the amino acid sequence encoded by the cDN A clone contained in ATCC Deposit 
No. 209582; and 

(1) the amino acid sequence of the mature METH2 polypeptide 
having the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 209582; 
wherein 

said % identity is calculated using the FASTDB computer program, with 
the parameters: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining 
Penalty=20, Randomization Group Length=0, Cutoff Score=l, Gap Penal ty=5, 
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Gap Size Penalty=0.05, Window Size=500 or the length of the subject amino acid 
sequence, whichever is shorter. 

17. A method for inhibiting angiogenesis in an individual, comprising 
administering an effective amount of a polypeptide of claim 1 0 to said individual. 

18. A polypeptide comprising the amino acid sequence m-n of SEQ ID 
NO:2, wherein m is an integer of 1 to 950, and wherein n is an integer of 10 to 
950. 

1 9. A polypeptide comprising the amino acid sequence m-n of SEQ ID 
NO:4, wherein m is an integer of 1 to 890, and wherein n is an integer of 10 to 
890. 
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AT(XX£M(XXX£AGCG(XXTCCG(^ 

MGNAERAPGSRSFGPVPTLLLLAAAL 

CTGGCCGTGTO^ACG{XT(X(XXXX(XCT(XGAGGAGGACGAGGAGCTAGTGG 
LAVSDALGRPSEEDEELVVPELERAP 

GGACACGGGACCACGCGCCTCCGCCTGCACGCCTTTGAC^ 

GHGTTRLRLHAFDQQLDLELRPDSSF 

TTGGCGCCCGGCTTCACGCTCCAGAACGTGGGGCGCAAATCCGGGTCCGAGACGC(X5CTT(^ 
LAPGFTLQNVGRKSGSETPLPETDLA 

CACTGCTTCTACTCCGGCAC&TGMTGGCGATCCCAGCT^ 

HCFYSGTVNGDPSSAAALSLCEGVRG 
GCCTTCTAO:T(£T(^A«mATTT(mtf 

AFYLLGEAYF IQPLPAASERLATAAP 
GGGGAGAAGCCGCCGGCACMCTACAGTTCCACCTGCTGCGGCG^ 

6EKPPAPL0FHL LRRHRQGDVGGTCG 
GTGGTGGACGACGAGCCGCGGCCGACTGGGAAAGCGGAGAGCGA&^ 

VVDOEPRPTGKAETEDEDEGTEGEDE 

axmcAGT&TrcaxKAa^^ 

GPQWSPQOPALQGVGQPTG TGS I RKK 

CGATTTGTGTCCAGTCACGGCTATGTGGAAACCATGCTTGTGG(^ACCAGTO 
RFVSSHRYVETMLVADQSMAEFHGSG 

CTAAAGCATTACCTTCTCAGGTTGTTTTGGGTGG()AGCCAGATO 

LKHYLLTLFSVAARLYKHPSI RNSVS 
CTGGTGGTGGTGAAGATCTTGGTCATO^ATGMCAGAAGGGG 

LVVVKILVIHOEQKGPEVTSNAALTL 

CGGAACTTTTGCAACTGGCAGAAGCAGCACAACCCACCCAGTGACCGGGATGC& 
RNFCNWQKQHNPPSDRDAEHYOTAI L 

TTCA(X%ACAGGACTTGTGTGGGTCC(IAGA()ATGTGATACTCTTGGGATGGCTGATGTTGGAACTGTGTGTGA^ 
ETRQDLCGSQTCOTLGMAOVGTVCDP 

AGCAGAAGCTGCTCCGTCATAGAAGATGAT(XnTTACA/^ 

SRSCSVIEDDGLQAAFTTAHELGHVF 

AACATGC()ACATGATGATGCAAAGCAGTGTGCCAG(X)TTMTGGTGTGMCCAGGATTCCCACATGATGGCGTCMTG 
NMPHDOAKOCASLNGVNQDSHMMASM 



FIG.1A 
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CTTTCCAACCTGGA(XIACAGCWGCCTTGGTCTCCTTGCAGTG(X^TACATGATTACATCATTTCTGGATAATGGTCAT 
LSNLDHSQPWSPCSAYMI TSFLDNGH 

GG(X5MTGTTTGATGGACAAGCCTCAGMT(XX^TACAGCT(XX)AGG(X5ATCTCCCT(XJC^ 
GECLMOKPONP IQLPGDLPGTSYDAN 

CGGCAGTGCCAGTTTACATTTGGGGAGGACTCCAMCACTGCCCTGATGCAGCCAGCACATGTAGCACCTTGTGGTGT 
RQCQFTFGEOSKHCPDAASTCSTLWC 

ACCGGCACCTCTGGTGGGGTGCTGGTGTGTCAAACCAMCACTTCCCGTGGGCGGATGGCACCAGCTGTGGAGAAG^ 
TGTSGGVLVCQTKHFPWADGTSCGEG 

AMTGGTGTATCMCGGCAAGTGTGTGMCAAAACCGA()AGAAAGCATTTTC^ 
KWC 1 NGKCVNKTDRKHFDTPFHGSWG 

ATGTGGGGGCCTTGGGGAGACTGTTCGAGAACGTGCGGTGGAGGAGTCCAGTA(}ACGATGAGGGMTG 
MWGPWGDCSRTCGGGVQYTMRECDNP 

GTCCCAAAGAATGGAGGGAAGTACTGTGAAGGCAAACGAGTGCGCTACAGATCCTGTMCCTTGAGGACTGTCCAGAC 
VPKNGGKYCEGKR V R YRSCNLEDCPD 

MTMT(X5AAAAACCTTTAGAGAGGMCMTGTGAAG(^CA{^CGAGTTTTCAAAAGCTTCCTTT(^GAGTGGGCC^ 
NNGKTFREEQCEAHNEFSKASFGSGP 

GCGGTGGMTGGATTC(XM>TACGCTGGCGTCTCACCAAAGGACAGGTGCAAGCTCATCrc^ 
AVEWIPKYAGVSPKDRCKL I COAKG I 

GGCTACTTCTTCGTTTTGCAGCCCAAGGTTGTAGATGGT^ 

GYFFVLQPKVVDGTPCSPDSTSVCVQ 

GGACAGTGTGTAAAAGCTGGTTGTGATCGCATCATAGACTCCAAAAAGAAGTTTGATAMTGT(^ 
GQCVKAGCOR I IOSKKKFOKCGVCGG 

MTGGATCTACTTGTAAAAAMTATCAGGATCAGTTACTAGTGCAAAACCTGGATATCATGATATCATCACAATTCCA 
NGSTCKKISGSVTSAKPGYHDI I TIP 

ACTGGAGCCA(XM^TGGAAGTGAMCAG(X£AAGCAGAGGGG^ 

TGATNIEVKQRNQRGSRNNGSFLAIK 

GCTGCTGATGGCACATATATTCTTAATGGTGACTAC^TTTGTCCACCTTAGAGCAAGACATTATGTACAAA^ 
AADGTYILNGDYTLSTLEQDIMYKGV 

GTCTTGAGGTACAGGX&TCCTCTGCGGCATTGGAAAGM 

VLRYSGSSAALERIRSFSPLKEPLT I 
CAGGTTCTTACTGTGGGCMTGCCCTTOGACCTAAMn^ 

QVLTVGNALRPKIKYTYFVKKKKESF 



FIG.1B 
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AATGCTATCC()CACTTTTTCAGCATGGGTCATTGAAGAGTGGGGCGA^^ 

NAIPTFSAWVIEEWGECSKSCELGWQ 

AGAAGACTGGTAGMTGCCGACACATTMTGGACAGCCTGCTTCCGAGTGTG^ 
RRLVECRDINGQPASECAKEVKPAST 

AGACCTTGTGCAGACCATCCCTG(XXCCAGTGGCAGCTGG^ 

RPCADHPCPQWQLGEWSSCSKTCGKG 

TACAAAAAAAGAAGCTTGAAGTGTCTGTCCCATGATGGAGGGGTGTTATCO 
YKKRSIKCLSHDGGVLSHESCDPLK'K 

CCTAAACATTTCATAGACTTTTG(^CMTGG(^MTGCAGTTAAGTGGTTTAAGTGGTGTTAGCTTTGAGGC 
PKHF IDFCTMAECS 

AAAGTGAGGAAGGGCTGGTGCAGGGAAAGCAAGAAGGCTGGAGGGATC^ 

TATCAGTAAGGTGGGATTATGGGGGTAGATAGAAAAGGAGTTGMTCATCAGAGTAAACTGCCAGTTGC 

AGGATAGTTAGTGAGGATTATTAACCTCTGAG(^TGATATAG^ 

TTTTGTTACATCTATTACAAGTTTAGAAAAAACAAAGCMTTGTCAAAA 

GCGGC(XJCTCTAGAGGATCCCT(X>AGGGGCCCAAGCTTACGGGTGCATGNTGTCATNAGTCTN 



FIG. 1 C 



SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCT/US99/01313 



4/19 

ATGTTaXXXmXXGGCGO^^ 

MFPAPAAPRWLPFLILLLLLLLPLAR 

GGCGCCCCGGCCCGG^^ 

GAPARPAAGGQASELVVPTRLPG SAG 

GAGCT(X^T(XIACCTGTC{XXX)TTGGGCAAGGGCTTCGTGTTGCGCCTGGCGCC^ 
ELALHLSAFGKGFVLRIAPDDSFLAP 

GAGTT(M5ATay^GCatT(Xm»:T(XGGaxmX 

EFKIERLGGSGRATGGERGLRGCFFS 
GGtMXnGMTGGGGAGCCGGAGTCGCTGGCGGCGGTCA^ 

GTVNGEPESLAAVSLCRGLSGSFLLD 
GGttAGGAGTTCACCAT(XAGCCGCAGO^^ 

GEEFT IQPQGAGGSLAOPHRLQRWGP 
GCCGGAGCCGGCCCCCTCCO^AGGACCCGAGTGGGAGGTGGAGA^^ 

AGARPLPRGPEWEVETGEGQRQERGD 

CACCAGGAGGACAGCGAGGAGGAGAGCCAAGAAGAGGAGGCAGAAGGCGCTAG 
HQEDSEEESQEEEAEGASEPPPPLGA 

ACGAGTAGGAC(MXX»TTTGTGTCTGAG(XX^ 

TS-RTKRFVSEARFVE. TLLVADASMAA 
TTCTACG(XX^/^TG(^AAGCACATCCTGACGTTM^ 

FYGADLQNHILTLMSVAAR IYKHPSI 

AAGMTTCCATCAACCTGATGGTGGTAAAAGTGCTGATCGTAGAAGATGAAAAATGGGG(XXAGAGGTG 
KNS1NLMVVKVL IVEOEKWGPEVSDN 

GGGGGGCTTA(MT(X£TACTTCT«^ 

GGLTLRNFCNWQRRF NO PSORHPEHY 

GACAGGG(X)ATCCTGCTCACCAGACAGAACTTCTGTGGGCAGGAGGGGCTGTGTGA 
DTAILLTRQNFCGQEGLCOTLGVADI 

GGGACCATTTGTGACCCtM^AAAGCTGCTCCGTGATGGAGGAT^^ 

GT ICDPNKSCSVIEDEGLQAAHTLAH 
GAACTAGGGCACGTCCTCAGWTGC(X)CAQ5ACGACTCCAAGCCCTGCAC 

ELGHVLSMPHDDSKPCTRLF. GPMGKH 
CAGGTGAT(XXACGGCTGTTCGT(X)ACCTGAACCAGACGCTGCCCT(X>Ta 

HVMAPLFVHLNQTLPWSPCSAMYLTE 

CTTCTGGAG(XXjGGGCACGGAGACTGTCTCCTGGATGCCCCTGGTGGGGCC^ 
LLOGGHGOCLLDAPGAALPLPTGLPG 

(XXIATGG(XXTGTAGCAGCTGGACCA(^TGCAGGCAGATCTTTGGGCCGGATTTC^ 
RMALYQLDOQCRQI FGPDFRHCPNTS 

FIG.2A 
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GCTCmCGTCTGCGCCCAGCTTTGCT 

AQDVCAQLWCHTDGAEPLCHTKNGSL 
(XCT(XX£TGACGGCACG(XGTGCGGGCCTG^^ 

PWADGTPCGPGHLCSEGSCLPEEEVE 

AtXCCCAAGCQXJTGGTAGATGGAGGCTGGGCACCGTGGGGACCCTGGGGAGMTGTTCTC^ 
RPKPVVOGGWAPWGPWGECSRTCGGG 

GTACAGTTTT(XACCGTGAGTGCAAGGACCCCGAGCCTCAGAATGGAGGAAGATACTG 
VQFSHRECKDPEPONGGRYCLGRRAK 

TACCAGTCATGCCACACGGAGGMTGCCCCGCTGAGXGAAA 

YQSCHTEECPPDGKSFREQQCEKYNA 

TACMTTACACTGACATGGA(XXX>MTCTCCTGCAGTGGGTCCCCAAGTATGCTGGGGTGTC 
YNYTOMOGNLLQWVPKYAGVSPRDRC 

AAGTTGTTCTGCCGAGCCCGGGGGAGGAGCGAGTTCAAAGTGTTCGAGGCCAAGGTGATTGATG^ 
KLFCRARGRSEFKVFEAKV I DGTLCG 

CC/^AAACACTGGCCATCTGTGTCCGTGGCCAGTGTGTCAAGGCGGGCTGTGACCArc 
PETLAI CVRGQCVKAGCDHVVDSPRK 

CTGGACAMTGCGGGGTGTGTGGtXXX^AAGGCA 

LDKCGVCGGKGNSCRKVSGSLTPTNY 
GGCTACAATGACATTGTCACCATCCCAGCTGGTGC^ 

GYNDIVT IPAGATNIDVKQRSHPGVQ 
AAGGATGGGAACTACCTGGGGCTGAAGACGCCTGATGGGCAGTA^ 

NOGNYLALKTAOGQYLLNGNLAI SA1 
GAGCAGGACATCTTGGTGAAHXIGACCATCCTGA^ 

EQD1LVKCT1LKYSGSIATLERLQSF 

CGGGGCTTGCCAGAGCCTCTGACAGTGCAGCTCCTGACAGTCCCTGGCGAGGTCTTCCCCCCAA 
RPLPEPL TVQLLTVPGEVF PPKVKYT 

TTCTTTGTTCCTMTGACGTGGACTTTAGCATGCAGAGCAGCAAAGAGAGAGCAACCACCAAC^ 
FFVPNDVDFSMQSSKERATTN I 1QPL 

CTCCAGGCACAGTGGGTGCTGGGGGACTGGTCTGAGTGCTCTAGCACCTGCGGGGCCGGC 
IHAQWVLGDWSECSSTCGAGWQRRTV 

GAGTG(XGGAOXCTCCKCCmCTCT(mCCT(X^CAAGGCTCTGAA^ 
ECRDPSGQASATCNKALKPEDAKPCE 

AG(X^TGTG(XXCCTGTGATTCAGGGGGGCAGGGGCCAGTCTTGTGCTCCTGGACATG 
S Q L C P I 

AAGGTCTCCACTGTGGTGACTGGGTCCCTTGGCWTAT^ 
(XnCCAGT/CTGCACAAATTCCTAAGGGGGAAGAGAAA 
NAATGGAACT7GCTCGGGTTCMTTAAAGGCATAAGTTAAAGTAAATTCATTATGATCAACAGA 
TTGCANGATACAACTANTAAAAAAAAAAAAAAAAAAAAAAAAAA - _ 



SUBSTITUTE SHEET (RULE 26) 



WO 99737660 



PCT/US99/01313 



6/19 



I — CS - — CM 

: — 3=2C— 3:3:-- 




CD 



CS|«— ro — O? 

ct» ro r-. CO CO 
rocsjro roron 



SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCT/US99/01313 



ES- xS r 5x r S 



CD CD CD 



Icdcdo! 



loool 



QCOu 



|oou| 



o CO . 



— » COOO CO GU 




cdcdcd 



Iq.cl.q.1 






|uoo| 





— CM «— CM = SZ* 



55%. 



|0- Q- Q-] 



CO CO O 

or 

CD CD 

<<: «< 
>- >- 





o 


1 1 




CD. 








1 1 < 






f 1 ^ 






1 1 CL_ 


1 >or> 


! 1 ^ 




1 1 CO 




LU 


I 1 > 


1 Q_ 




1 1 CO 


> > 




1 ! CD 






1 1 C_> 




H 


1 1 u_ 




^: 


u 1 CD 




I— CL QT 






CL. OIC 


— > 




1 — 1=> 




CO 


<c — =s 


Q_ Q_ 






or|a- 

— tu_ 


IS 


U.H-X 
CO 1 — CD 






Ui< — t 


z u-i 


o 





U_ 1 




«c < < 




OF 







CO CO CO 


CD CD 


CD 










> 


> 



[cd cd|co 




CD, °lxz^ 








a: 




COLaJOC 








CO *c *c 


CL. 




|oou 


f CO 




1— D>Q- 


>- ISO 


l|00| 




1 Ou. 


CO CO co | 




^9- 


cncsi 

00(0 0 

^» to 




CO 
I CD 
I CD 
CD 
CJ) 
I CL. 
I 

I CO 
I CD 
I Q_ 
I CO 

I ^ 
I ^ 
I 



— tO IT) 



1^.000 to ojro 



i >- 

I CD 

i > 

I CO 
I CD 
I (X) 

I LU 

I —I 
I > 



CO CD 
Os» ^- -rt- 

oo oo oo 



CD 

ro 
O 



SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 PCT/US99/01313 



8/19 

o 



C£jO|CO 

z to a: 



CD 



ifs (Si 



CO CO CO 

ooo 

UJ LaJIQl. 



CD CO UJ 



UJ CO 



CJ 



I a: 



CO O CD 

< |uj uj| 



CD 
CO 



CO 



CO 

>- 
cc 

CO 



[Ml S 





oc 


CD 




CD 


u_ 




u_ 



CO 

DC 

CD 
CD 
Or 
CD 
OS 
CD 
DC 
CO 
CO 



zn co 



CO 



CO 



CO 



CM 
<7> 





& 


CD. 


P 


DC 




Q_ 


cc 




> 






UJ 
1 


UJ 

CD 






■— • 






—J 




CD 


UJ 




CL 


o 




Q_ 


— 




Q_ 






Or 


o 




5» 

UJ 


cc 










> 






i 


UJ 




»— 


>- 






Q_ 




> 


CO 




CO 


Q- 




—J 


Q£ 




»— 


QZ 




Q_ 


Q- 






— * 




UJ 


I 

•pr 

Q_ 




— • 


£X 




a 


ar 






> 




31 


UJ 




DC 


CD 




CD 


UJ 




CO 


— 1 




cu 


CD 




Q_ 


CL. 





UJ 
<C 


DC 




OC 


>- 




CD 


Q_ 




cd 


> 




> 


CD 






5> 




*— 






— » 








^" 






UJ 






CL. 




Q- 


IE 




co z 


CD 




IoIcdI 


UJ 




UJ CO 






1 DC 


«C 


U_ 


1 cd 


Z 


DC 


1 CD 




CD 


1 _J 


CO 




1 DC 


CO 




1 z 




UJ 


1 >- 


z 


DC 


<7> 

oo 

cd 
ct> — 


1139 





o 
o 



SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCT/US99/01313 



9/19 



«h-i ■ x/////je=± imm 

SP PRODOMAIN METALOPROTEASE CYS- TSP TSPTSP 

RICH 

*TH-2 ■ vzzzz^m urn 
p*i ■ vzzzzzm msm 

FIG.4 



SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCT/US99/01313 




SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCT/US99/01313 



11/19 

125 n 




FIG.6B 



SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCT/US99/01313 



12/19 



UJ 




0 3 5 10 15 30 



GST-METH1 fog/ml] 

FIG.6C 




GST-METH2 fa/ml] 

FIG.6D 

SUBSTITUTE SHEET (RULE 26) 



WO99AH660 



PCT/US99/01313 



13/19 




FIG.7B 



SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCTAJS99/01313 



14/19 




FIG.7D 



SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCT/US99/01313 




FIG.7E 



SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 2d) 



WO 99/37660 



PCI7US99/01313 



17/19 



O 



cd 



-< 
c-> 

CD 

<c 
<c 
<c 
<c 
<c 



CD 



\ 

-CJ> N 

o K > 

... ^ 

I — v 
CD N 

> 

o 
«< 

I — 
CD 
I — 

«< 
CD 
<c 
-< 



CD 



CO 



CD 
<C 
CD 
CD 
<C 
CD 



cr> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCT/US99/013I3 



18/19 



8 

CD 
O 
CC 
I 



1 

LO 

O 
CD 



o 

CO 
CD 
O 
C£ 
I 



O 

CD 
I 

CO 

2r 
o 

CD 
UJ 

C£ UJ LU 



I 

CO CO 



o 
to 

Z CD 

g i 



o 

CO 
CD 
CD 

=> =5? 



o o Q 
55 S g 



or 



or £ 



o 
o 



i 

CD 
Q_ 



cu 
o 



CD 

Or: CD 

UJ CXI 

CD UJ 

2: CD 

CO UJ 

UJ CO 

I UJ 
CO 



3 

:=> 

CD 
CO 



I UJ Wf 

CO I I 

z co co 

0 Z ZD 

5 9 i 

UJ CD rv? 

a u S 

CD , 

1 y ^ 

^ 3= z 

c? ft Q 

1 I g 



i=- CD 



Q_ 



CD 
CO 



I 



CD 

UJ 
CD 



I 

5 



O 



CD 



CO 



-S 

GO 

: cd 

"CD 

: to 



-CD 



ill 



<C CD CD 




* « U- 




SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SHEET (RULE 26) 



WO 99/37660 



PCT/US99/01313 



-1- 

SEQUENCE LISTING 

<110> Iruela-Arispe, Luisa 
Hastings, Gregg A. 
Ruben, Steven M. 

<120> Methl and Meth2 Polynucleotides and Polypeptides 

<130> 1488.107PC02 

<140> 
<141> 

<150> 60/072,298 

<151> 1998-01-23 

<150> 60/098,539 
<151> 1998-08-28 

<160> 93 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 3261 

<212> DNA 

<213> .Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (2853) 

<220> 

<221> UNSURE 
<222> (3095) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (3248) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (3255) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (3261) 

<223> May be any nucleic acid 



<400> 1 

atg ggg aac gcg gag egg get ccg ggg tct egg age ttt ggg ccc gta 



WO 99/37660 



PCT/US99/01313 



-2- 

Met Gly Asn Ala Glu Arg Ala Pro Gly Ser Arg Ser Phe Gly Pro Val 
1 5 10 15 

ccc acg ctg ctg ctg etc gec gcg gcg eta ctg gee gtg teg gac gca 96 
Pro Thr Leu Leu Leu Leu Ala Ala Ala Leu Leu Ala Val Ser Asp Ala 
20 25 30 



etc ggg cgc ccc tec gag gag gac gag gag eta gtg gtg ccg gag ctg 144 
Leu Gly Arg Pro Ser Glu Glu Asp Glu Glu Leu Val Val Pro Glu Leu 
35 40 45 

gag cgc gec ccg gga cac ggg acc acg cgc etc cgc ctg cac gec ttt 192 
Glu Arg Ala Pro Gly His Gly Thr Thr Arg Leu Arg Leu His Ala Phe 
50 55 60 

gac cag cag ctg gat ctg gag ctg egg ccc gac age age ttt ttg gcg 240 
Asp Gin Gin Leu Asp Leu Glu Leu Arg Pro Asp Ser Ser Phe Leu Ala 
65 70 75 80 



ccc ggc ttc acg etc cag aac gtg ggg cgc aaa tec ggg tec gag acg 288 
Pro Gly Phe Thr Leu Gin Asn Val Gly Arg Lys Ser Gly Ser Giu Thr 
85 90 95 

ccg ctt ccg gaa acc gac ctg gcg cac tgc ttc tac tec ggc acc gtg 336 
Pro Leu Pro Glu Thr Asp Leu Ala His Cys Phe Tyr Ser Gly Thr Val 
100 105 HO 

aat ggc gat ccc age teg get gec gec etc age etc tgc gag ggc gtg 384 
Asn Gly Asp Pro Ser Ser Ala Ala Ala Leu Ser Leu Cys Glu Gly Val 
115 120 125 

cgc ggc gec ttc tac ctg ctg ggg gag gcg tat ttc ate cag ccg ctg 432 
Arg Gly Ala Phe Tyr Leu Leu Gly Glu Ala Tyr Phe He Gin Pro Leu 
130 135 140 

ccc gec gec age gag cgc etc gee acc gee gec cca ggg gag aag ccg 480 
Pro Ala Ala Ser Glu Arg Leu Ala Thr Ala Ala Pro Gly Glu Lys Pro 
145 150 155 160 



ccg gca cca eta cag ttc cac etc ctg egg egg aat egg cag ggc gac 528 
Pro Ala Pro Leu Gin Phe His Leu Leu Arg Arg Asn Arg Gin Gly Asp 
165 170 175 



gta ggc ggc acg tgc ggg gtc gtg gac gac gag ccc egg ccg act ggg 
Val Gly Gly Thr Cys Gly Val Val Asp Asp Glu Pro Arg Pro Thr Gly 
180 185 190 



576 



aaa gcg gag acc gaa gac gag gac gaa ggg act gag ggc gag gac gaa 624 
Lys Ala Glu Thr Glu Asp Glu Asp Glu Gly Thr Glu Gly Glu Asp Glu 
195 200 205 



ggg cct cag tgg teg ccg cag gac ccg gca ctg caa ggc gta gga cag 



672 
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Gly Pro Gin Trp Ser Pro Gin Asp Pro Ala Leu Gin Gly Val Gly Gin 
210 215 220 

ccc aca gga act gga age ata aga aag aag cga ttt gtg tec agt cac 720 
Pro Thr Gly Thr Gly Ser He Arg Lys Lys Arg Phe Val Ser Ser His 
225 230 235 240 

cgc tat gtg gaa acc atg ctt gtg gca gac cag teg atg gca gaa ttc 768 
Arg Tyr Val Glu Thr Met Leu Val Ala Asp Gin Ser Met Ala Glu Phe 
245 250 255 



cac ggc agt ggt eta aag cat tac ctt etc acg ttg ttt teg gtg gca 
His Gly Ser Gly Leu Lys His Tyr Leu Leu Thr Leu Phe Ser Val Ala 
260 265 270 

gee aga ttg tac aaa cac ccc age att cgt aat tea gtt age ctg gtg 
Ala Arg Leu Tyr Lys His Pro Ser He Arg Asn Ser Val Ser Leu Val 
275 280 285 

gtg gtg aag ate ttg gtc ate cac gat gaa cag aag ggg ccg gaa gtg 
Val Val Lys He Leu Val He His Asp Glu Gin Lys Gly Pro Glu Val 
290 295 300 

acc tec aat get gee etc act ctg egg aac ttt tgc aac tgg cag aag 
Thr Ser Asn Ala Ala Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Lys 
305 310 315 320 

cag cac aac cca ccc agt gac egg gat gca gag cac tat gac aca gca 
Gin His Asn Pro Pro Ser Asp Arg Asp Ala Glu His Tyr Asp Thr Ala 
325 330 335 



816 



864 



912 



960 



1008 



att ctt ttc acc aga cag gac ttg tgt ggg tec cag aca tgt gat act 1056 
lie Leu Phe Thr Arg Gin Asp Leu Cys Gly Ser Gin Thr Cys Asp Thr 
340 345 350 

ctt ggg atg get gat gtt gga act gtg tgt gat ccg age aga age tgc 1104 
Leu Gly Met Ala Asp Val Gly Thr Val Cys Asp Pro Ser Arg Ser Cys 
355 360 365 

tec gtc ata gaa gat gat ggt tta caa get gee ttc acc aca gee cat 1152 
Ser Val He Glu Asp Asp Gly Leu Gin Ala Ala Phe Thr Thr Ala His 
370 375 380 

gaa tta ggc cac gtg ttt aac atg cca cat gat gat gca aag cag tgt 1200 
Glu Leu Gly His Val Phe Asn Met Pro His Asp Asp Ala Lys Gin Cys - 
385 390 395 400 

gee age ctt aat ggt gtg aac cag gat tec cac atg atg gcg tea atg 1248 
Ala Ser Leu Asn Gly Val Asn Gin Asp Ser His Met Met Ala Ser Met 
405 410 415 

ctt tec aac ctg gac cac age cag cct tgg tct cct tgc agt gee tac 1296 
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Leu Ser Asn Leu Asp His Ser Gin Pro Trp Ser Pro Cys Ser Ala Tyr 
420 425 430 

atg att aca tea ttt ctg gat aat ggt cat ggg gaa tgt ttg atg gac 
Met lie Thr Ser Phe Leu Asp Asn Gly His Gly Glu Cys Leu Met Asp 
435 440 445 

aag cct cag aat occ ata cag etc cca ggc gat etc cct ggc ace teg 
Lys Pro Gin Asn Pro lie Gin Leu Pro Gly Asp Leu Pro Gly Thr Ser 



1344 



1392 



450 



455 460 



tac gat gee aac egg cag tgc cag ttt aca ttt ggg gag gac tee aaa 1440 
Tyr Asp Ala Asn Arg Gin Cys Gin Phe Thr Phe Gly Glu Asp Ser Lys 
465 470 475 480 

eac tgc cct gat gca gee age aca tgt age ace ttg tgg tgt ace ggc 1488 
His Cys Pro Asp Ala Ala Ser Thr Cys Ser Thr Leu Trp Cys Thr Gly 
485 490 495 

acc tct ggt ggg gtg ctg gtg tgt caa aec aaa cae tte ecg tgg gcg 1536 
Thr Ser Gly Gly Val Leu Val Cys Gin Thr Lys H 1S Phe Pro Trp Ala 
500 505 510 

gat ggc acc age tgt gga gaa ggg aaa tgg tgt ate aac ggc aag tgt 1584 
Asp Gly Thr Ser Cys Gly Glu Gly Lys Trp Cys lie Asn Gly Lys Cys 
515 520 525 

gtg aac aaa ace gac aga aag eat ttt gat aeg cct ttt cat gga age 1632 
Val Asn Lys Thr Asp Arg Lys His Phe Asp Thr Pro Phe Hxs Gly Ser 
530 535 540 

tgg gga atg tgg ggg cct tgg gga gac tgt teg aga aeg tgc ggt gga 1680 
Trp Gly Met Trp Gly Pro Trp Gly Asp Cys Ser Arg Thr Cys Gly Gly 
545 550 555 560 

gga gtc cag tac aeg atg agg gaa tgt gac aac cea gtc cca aag aat 1728 
Gly Val Gin Tyr Thr Met Arg Glu Cys Asp Asn Pro Val Pro Lys Asn 
565 570 575 

gga ggg aag tac tgt gaa ggc aaa cga gtg cgc tac aga tec tgt aac 1776 
Gly Gly Lys Tyr Cys Glu Gly Lys Arg Val Arg Tyr Arg Ser Cys Asn 
580 585 590 

ett gag gac tgt cea gac aat aat gga aaa ace ttt aga gag gaa caa 1824 
Leu 111 Asp Cys Pro Asp Asn Asn Gly Lys Thr Phe Arg Glu Glu Gin 
595 600 605 

tgt gaa gca cae aac gag ttt tea aaa get tec ttt ggg agt ggg cct 1872 
Cvs Glu Ala His Asn Glu Phe Ser Lys Ala Ser Phe Gly Ser Gly Pro 
610 615 620 

gcg gtg gaa tgg att ece aag tac get ggc gtc tea cea aag gac agg 1920 
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Ala Val Glu Trp lie Pro Lys Tyr Ala Gly Val Ser Pro Lys Asp Arg 
625 630 635 640 

tgc aag etc ate tgc caa gee aaa ggc att ggc tac ttc ttc gtt ttg 
Cys Lys Leu He Cys Gin Ala Lys Gly He Gly Tyr Phe Phe Val Leu 
645 650 655 

cag ccc aag gtt gta gat ggt act cca tgt age cca gat tec acc tct 
Gin Pro Lys Val Val Asp Gly Thr Pro Cys Ser Pro Asp Ser Thr Ser 
660 665 670 

gtc tgt gtg caa gga cag tgt gta aaa get ggt tgt gat cgc ate ata 
Val Cys Val Gin Gly Gin Cys Val Lys Ala Gly Cys Asp Arg He He 
675 680 685 



1968 



2016 



2064 



gac tec aaa aag aag ttt gat aaa tgt ggt gtt tgc ggg gga aat gga 2112 
Asp Ser Lys Lys Lys Phe As P ' Lys Cys Gly Val Cys Gly Gly Asn Gly 
690 695 700 



tct act tgt aaa aaa ata tea gga tea gtt act agt gca aaa cct gga 
Ser Thr Cys Lys Lys He Ser Gly Ser Val Thr Ser Ala Lys Pro Gly 
705 710 ' 715 720 

tat cat gat ate ate aca att cca act gga gec acc aac ate gaa gtg 
Tvr His Asp He He Thr He Pro Thr Gly Ala Thr Asn He Glu Val 
725 730 735 



2160 



2208 



aaa cag egg aac cag agg gga tec agg aac aat ggc age ttt ctt gec 2256 
Lys Gin Arg Asn Gin Arg Gly Ser Arg Asn Asn Gly Ser Phe Leu Ala 
740 745 750 



ate aaa get get gat ggc aca tat att ctt aat ggt gac tac act ttg 
He Lys Ala Ala Asp Gly Thr Tyr He Leu Asn Gly Asp Tyr Thr Leu 



2304 



755 



760 765 



tec acc tta gag caa gac att atg tac aaa ggt gtt gtc ttg agg tac 2352 
Ser Thr Leu Glu Gin Asp He Met Tyr Lys Gly Val Val Leu Arg Tyr 
770 775 780 



age ggc tec tct gcg gca ttg gaa aga att cgc age ttt age cct etc 
Ser Gly Ser Ser Ala Ala Leu Glu Arg He Arg Ser Phe Ser Pro Leu 
785 790 795 800 

aaa gag ccc ttg acc ate cag gtt ctt act gtg ggc aat gec ctt cga 
Lys Glu Pro Leu Thr He Gin Val Leu Thr Val Gly Asn Ala Leu Arg 
805 810 815 

cct aaa att aaa tac acc tac ttc gta aag aag aag aag gaa tct ttc 
Pro Lys He Lys Tyr Thr Tyr Phe Val Lys Lys Lys Lys Glu Ser Phe 
820 825 830 

aat get ate ccc act ttt tea gca tgg gtc att gaa gag tgg ggc gaa 



2400 



2448 



2496 



254 4 
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Asn Ala He Pro Thr Phe Ser Ala Trp Val He Glu Glu Trp Gly Glu 
8 35 840 845 

tgt tct aag tea tgt gaa ttg ggt tgg cag aga aga ctg gta gaa tgc 2592 
Cys Ser Lys Ser Cys Glu Leu Gly Trp Gin Arg Arg Leu Val Glu Cys 
850 855 860 



cga gac att aat gga cag cct get tec gag tgt gca aag gaa gtg aag 
Arg Asp He Asn Gly Gin Pro Ala Ser Glu Cys Ala Lys Glu Val Lys 
865 870 875 880 

cca gec age acc aga cct tgt gca gac cat ccc tgc ccc cag tgg cag 
Pro Ala Ser Thr Arg Pro Cys Ala Asp His Pro Cys Pro Gin Trp Gin 
885 890 895 



2640 



2688 



ctg ggg gag tgg tea tea tgt tct aag acc tgt ggg aag ggt tac aaa 2736 
Leu Gly Gib Trp Ser Ser Cys Ser Lys Thr Cys Gly Lys Gly Tyr Lys 
900 905 910 



aaa aga age ttg aag tgt ctg tec cat gat gga ggg gtg tta tct cat 
Lys Arg Ser Leu Lys Cys Leu Ser His Asp Gly Gly Val Leu Ser His 
915 920 925 

gag age tgt gat cct tta aag aaa cct aaa cat ttc ata gac ttt tgc 
Glu Ser Cys Asp Pro Leu Lys Lys Pro Lys His Phe He Asp Phe Cys 
930 935 940 



2784 



2832 



aca atg gca gaa tgc agt taa gtggtttaag tggtgttagc tttgaggcaa 2883 
Thr Met Ala Glu Cys Ser 
945 950 

ggcaaagtga ggaagggctg gtgcagggaa agcaagaagg ctggagggat ccagcgtatc 2943 
ttgccagtaa ccagtgaggt gtatcagtaa ggtgggatta tgggggtaga tagaaaagga 3003 
gttgaatcat cagagtaaac tgccagttgc aaatttgata ggatagttag tgaggattat 3063 
taacctctga gcagtgatat agcataataa anccccgggc attattatta ttatttcttt 3123 
tgttacatct attacaagtt tagaaaaaac aaagcaattg tcaaaaaaaa aaaaaaaaaa 3183 
aaaaaaaaaa aaagggegge cgctctagag gatccctcga ggggcccaag ettacgegtg 3243 

3261 

catgntgtca tnagtctn 



<210> 2 

<211> 950 

<212> PRT 

<213> Homo sapiens 

<400> 2 



/ 
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Met Gly Asn Ala Glu Arg Ala Pro Gly Ser Arg Ser Phe Gly Pro Val 
1 5 10 15 

Pro Thr Leu Leu Leu Leu Ala Ala Ala Leu Leu Ala Val Ser Asp Ala 
20 25 30 

Leu Gly Arg Pro Ser Glu Glu Asp Glu Glu Leu Val Val Pro Glu Leu 



35 



40 



45 



Glu Arg Ala Pro Gly His Gly Thr Thr Arg Leu Arg Leu His Ala Phe 
50 



55 60 



Asp 
65 



Gin Gin Leu Asp Leu Glu Leu Arg Pro Asp Ser Ser Phe Leu Ala 
70 75 80 



Pro Gly Phe Thr Leu Gin Asn Val Gly Arg Lys Ser Gly Ser Glu Thr 
85 90 95 

Pro Leu Pro Glu Thr Asp Leu Ala His Cys Phe Tyr Ser Gly Thr Val 
100 105 11° 



Asn 



Gly Asp Pro Ser Ser Ala Ala Ala Leu Ser Leu Cys Glu Gly Val 



115 



120 125 



Arg Gly Ala Phe Tyr Leu Leu Gly Glu Ala Tyr Phe lie Gin Pro Leu 
130 135 I 40 

Pro Ala Ala Ser Glu Arg Leu Ala Thr Ala Ala Pro Gly Glu Lys Pro 
145 150 155 160 

Pro Ala Pro Leu Gin Phe His Leu Leu Arg Arg Asn Arg Gin Gly Asp 
165 l™ 175 

Val Gly Gly Thr Cys Gly Val Val Asp Asp Glu Pro Arg Pro Thr Gly 
180 185 19° 

Lys Ala Glu Thr Glu Asp Glu Asp Glu Gly Thr Glu Gly Glu Asp Glu 
195 200 205 

Gly Pro Gin Trp Ser Pro Gin Asp Pro Ala Leu Gin Gly Val Gly Gin 
210 215 220 

Pro Thr Gly Thr Gly Ser He Arg Lys Lys Arg Phe Val Ser Ser His 
225 230 235 240 

Arq Tyr Val Glu Thr Met Leu Val Ala Asp Gin Ser Met Ala Glu Phe 
245 250 255 

His Gly Ser Gly Leu Lys His Tyr Leu Leu Thr Leu Phe Ser Val Ala 
260 265 270 

Ala Arg Leu Tyr Lys His Pro Ser lie Arg Asn Ser Val Ser Leu Val 
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275 



280 



285 



Val Val Lys lie Leu Val He His Asp Glu Gin Lys Gly Pro Glu Val 
290 295 300 

Thr Ser Asn Ala Ala Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Lys 
305 310 315 320 

Gin His Asn Pro Pro Ser Asp Arg Asp Ala Glu His Tyr Asp Thr Ala 
325 330 335 

He Leu Phe Thr Arg Gin Asp Leu Cys Gly Ser Gin Thr Cys Asp Thr 
340 345 350 

Leu Gly Met Ala Asp Val Gly Thr Val Cys Asp Pro Ser Arg Ser Cys 
355 360 365 

Ser Val He Glu Asp Asp Gly Leu Gin Ala Ala Phe Thr Thr Ala His 
370 375 380 

Glu Leu Gly His Val Phe Asn Met Pro His Asp Asp Ala Lys Gin Cys 
385 390 395 400 

Ala Ser Leu Asn Gly Val Asn Gin Asp Ser His Met Met Ala Ser Met 
405 410 415 

Leu Ser Asn Leu Asp His Ser Gin Pro Trp Ser Pro Cys Ser Ala Tyr 
420 425 430 

Met He Thr Ser Phe Leu Asp Asn Gly His Gly Glu Cys Leu Met Asp 
435 440 445 

Lys Pro Gin Asn Pro He Gin Leu Pro Gly Asp Leu Pro Gly Thr Ser 
450 455 460 

Tyr Asp Ala Asn Arg Gin Cys Gin Phe Thr Phe Gly Glu Asp Ser Lys 
465 470 475 480 

His Cys Pro Asp Ala Ala Ser Thr Cys Ser Thr Leu Trp Cys Thr Gly 
485 490 495 

Thr Ser Gly Gly Val Leu Val Cys Gin Thr Lys His Phe Pro Trp Ala 
500 505 510 

Asp Gly Thr Ser Cys Gly Glu Gly Lys Trp Cys He Asn Gly Lys Cys 
515 520 525 

Val Asn Lys Thr Asp Arg Lys His Phe Asp Thr Pro Phe His Gly Ser 
530 535 540 

Trp Gly Met Trp Gly Pro Trp Gly Asp Cys Ser Arg Thr Cys Gly Gly 
545 550 555 560 
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Gly Val Gin Tyr Thr Met Arg Glu Cys Asp Asn Pro Val Pro Lys Asn 
565 570 575 

Gly Gly Lys Tyr Cys Glu Gly Lys Arg Val Arg Tyr Arg Ser Cys Asn 
580 585 590 

Leu Glu Asp Cys Pro Asp Asn Asn Gly Lys Thr Phe Arg Glu Glu Gin 
595 600 605 

Cys Glu Ala His Asn Glu Phe Ser Lys Ala Ser Phe Gly Ser Gly Pro 
610 615 620 

Ala Val Glu Trp He Pro Lys Tyr Ala Gly Val Ser Pro Lys Asp Arg 
625 ' 630 635 640 

Cys Lys Leu He Cys Gin Ala Lys Gly He Gly Tyr Phe Phe Val Leu 
645 650 655 



Gin Pro Lys Val Val Asp Gly Thr Pro Cys Ser Pro Asp Ser Thr Ser 
660 665 670 

Val Cys Val Gin Gly Gin Cys Val Lys Ala Gly Cys Asp Arg He He 



675 



680 



685 



Asp Ser Lys Lys Lys Phe Asp Lys Cys Gly Val Cys Gly Gly Asn Gly 
690 695 700 

Ser Thr Cys Lys Lys He Ser Gly Ser Val Thr Ser Ala Lys Pro Gly 
705 ' 710 715 720 

Tyr His Asp He He Thr He Pro Thr Gly Ala Thr Asn He Glu Val 
725 730 735 

Lys Gin Arg Asn Gin Arg Gly Ser Arg Asn Asn Gly Ser Phe Leu Ala 
740 745 750 

He Lys Ala Ala Asp Gly Thr Tyr He Leu Asn Gly Asp Tyr Thr Leu 
755 760 765 

Ser Thr Leu Glu Gin Asp He Met Tyr Lys Gly Val Val Leu Arg Tyr 
770 775 780 

Ser Gly Ser Ser Ala Ala Leu Glu Arg He Arg Ser Phe Ser Pro Leu 



785 



790 



795 



800 



Lys Glu Pro Leu Thr He Gin Val Leu Thr Val Gly Asn Ala Leu Arg 
805 810 815 

Pro Lys He Lys Tyr Thr Tyr Phe Val Lys Lys Lys Lys Glu Ser Phe 
820 825 830 
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Asn Ala He Pro Thr Phe Ser Ala Trp Val He Glu Glu Trp Gly Glu 
835 840 845 

Cys Ser Lys Ser Cys Glu Leu Gly Trp Gin Arg Arg Leu Val Glu Cys 
850 855 860 

Arq Asp He Asn Gly Gin Pro Ala Ser Glu Cys Ala Lys Glu Val Lys 
865 870 875 880 

Pro Ala Ser Thr Arg Pro Cys Ala Asp His Pro Cys Pro Gin Trp Gin 
885 890 895 

Leu Gly Glu Trp Ser Ser Cys Ser Lys Thr Cys Gly Lys Gly Tyr Lys 
900 905 910 

Lys Arg Ser Leu Lys Cys Leu Ser His Asp Gly Gly Val Leu Ser His 
915 920 925 

Glu Ser Cys Asp Pro Leu Lys Lys Pro Lys His Phe He Asp Phe Cys 
930 935 940 

Thr Met Ala Glu Cys Ser 
945 950 



<210> 3 

<211> 3008 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) - - (2670) 

<220> 

<221> UNSURE. 
<222> (2887) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (2957) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (2970) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (2981). 

<223> May be any nucleic acid 
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<400> 3 

atg ttc ccc gcc ccc gcc gcc ccc egg tgg ctt ccg ttc ctg ctg ctg 48 
Met Phe Pro Ala Pro Ala Ala Pro Arg Trp Leu Pro Phe Leu Leu Leu 
15 10 15 

ctg ctg ctg ctg ctg ctg ccg ctg gcc cgc ggc gcc ccg gcc egg ccc 96 
Leu Leu Leu Leu Leu Leu Pro Leu Ala Arg Gly Ala Pro Ala Arg Pro 
20 25 30 



gca gcc ggg ggg cag gcc teg gag ctg gtg gtg ccc acg egg ttg ccc 144 

Ala Ala Gly Gly Gin Ala Ser Glu Leu Val Val Pro Thr Arg Leu Pro 

35 40 45 

ggc age gcg ggc gag etc gcg etc cac ctg tec gcc ttc ggc aag ggc 192 

Gly Ser Ala Gly Glu Leu Ala Leu His Leu Ser Ala Phe Gly Lys Gly 

50 55 60 



ttc gtg ttg cgc ctg gcg ccc gac gac age ttc ctg gcg ccc gag ttc 240 
Phe Val Leu Arg Leu Ala Pro Asp Asp Ser Phe Leu Ala Pro Glu Phe 
65 70 75 80 



aag ate gag cgc etc ggg ggc tec ggc egg gcg ace ggg ggc gag egg 
Lys He Glu Arg Leu Gly Gly Ser Gly Arg Ala Thr Gly Gly Glu Arg 
85 90 95 



288 



ggg ctg cgc ggc tgt ttt ttt tec ggc ace gtg aat ggg gag ccc gag 336 
Gly Leu Arg Gly Cys Phe Phe Ser Gly Thr Val Asn Gly Glu Pro Glu 
100 105 HO 

teg ctg gcg gcg gtc age ctg tgc cgc ggg ctg age ggc tec ttc ctg 384 
Ser Leu Ala Ala Val Ser Leu Cys Arg Gly Leu Ser Gly Ser Phe Leu 
115 120 125 

ctg gac ggc gag gag ttc ace ate cag ccg cag ggc gcg ggg ggc tec 4 32 
Leu Asp Gly Glu Glu Phe Thr He Gin Pro Gin Gly Ala Gly Gly Ser 
130 135 140 

ctg get cag ccg cac cgc ctg cag cgc tgg ggt ccc gcc gga gcc cgc 4 80 
Leu Ala Gin Pro His Arg Leu Gin Arg Trp Gly Pro Ala Gly Ala Arg 
145 150 155 160 



ccc etc ccg cga gga ccc gag tgg gag gtg gag acg gga gag ggt cag 528 
Pro Leu Pro Arg Gly Pro Glu Trp Glu Val Glu Thr Gly Glu Gly Gin 
165 170 175 

agg cag gag aga gga gac cac cag gag gac age gag gag gag age caa 576 
Arg Gin Glu Arg Gly Asp His Gin Glu Asp Ser Glu Glu Glu Ser Gin 
180 185 190 



gaa gag gag gca gaa ggc get age gag ccg cca ccg ccc ctg ggg gcc 
Glu Glu Glu Ala Glu Gly Ala Ser Glu Pro Pro Pro Pro Leu Gly Ala 
195 200 205 



624 



WO 99/37660 



-12- 



PCT/US99/01313 



720 



768 



816 



864 



912 



acg agt agg acc aag egg ttt gtg tct gag gcg cgc ttc gtg gag acg 672 
Thr Ser Arg Thr Lys Arg Phe Val Ser Glu Ala Arg Phe Val Glu Thr 
210 215 220 

ctg ctg gtg gec gat gcg tec atg get gec ttc tac ggg gec gac ctg 
Leu Leu Val Ala Asp Ala Ser Met Ala Ala Phe Tyr Gly Ala Asp Leu 
225 230 235 240 

cag aac cac ate ctg acg tta atg tct gtg gca gec cga ate tac aag 
Gin Asn His He Leu Thr Leu Met Ser Val Ala Ala Arg He Tyr Lys 
245 250 255 

cac ccc age ate aag aat tec ate aac ctg atg gtg gta aaa gtg ctg 
His Pro Ser He Lys Asn Ser He Asn Leu Met Val Val Lys Val Leu 
260 265 270 

ate gta gaa gat gaa aaa tgg ggc cca gag gtg tec gac aat ggg ggg 
He Val Glu Asp Glu Lys Trp Gly Pro Glu Val Ser Asp Asn Gly Gly 
275 280 285 

ctt aca ctg cgt aac ttc tgc aac tgg cag egg cgt ttc aac cag ccc 
Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Arg Arg Phe Asn Gin Pro 
290 295 300 

age gac cgc cac cca gag cac tac gac acg gee ate ctg etc acc aga 960 
Ser Asp Arg His Pro Glu His Tyr Asp Thr Ala He Leu Leu Thr Arg 
305 310 315 320 

cag aac ttc tgt ggg cag gag ggg ctg tgt gac acc ctg ggt gtg gca 1008 
Gin Asn Phe Cys Gly Gin Glu Gly Leu Cys Asp Thr Leu Gly Val Ala 
325 330 335 

gac ate ggg acc att tgt gac ccc aac aaa age tgc tec gtg ate gag 
Asp He Gly Thr He Cys Asp Pro Asn Lys Ser Cys Ser Val He Glu 
340 345 350 

gat gag ggg etc cag gcg gee cac acc ctg gee cat gaa eta ggg cac 
Asp Glu Gly Leu Gin Ala Ala His Thr Leu Ala His Glu Leu Gly His 
355 360 365 

gtc etc age atg ccc cac gac gac tec aag ccc tgc aca egg etc ttc 
Val Leu Ser Met Pro His Asp Asp Ser Lys Pro Cys Thr Arg Leu Phe 
370 375 380 

ggg ccc atg ggc aag cac cac gtg atg gca ccg ctg ttc gtc cac ctg 
Gly Pro Met Gly Lys His His Val Met Ala Pro Leu Phe Val His Leu 
385 390 395 400 

aac cag acg ctg ccc tgg tec ccc tgc age gec atg tat etc aca gag 1248 ( 
Asn Gin Thr Leu Pro Trp Ser Pro Cys Ser Ala Met Tyr Leu Thr Glu 
405 410 415 



1056 



1104 



1152 



1200 
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ctt ctg gac ggc ggg cac gga gac tgt etc ctg gat gec cct ggt gcg 1296 
Leu Leu Asp Gly Gly His Gly Asp Cys Leu Leu Asp Ala Pro Gly Ala 
420 425 430 

gec ctg ccc etc ccc aca ggc etc ccg ggc cgc atg gec ctg tac cag 134 4 
Ala Leu Pro Leu Pro Thr Gly Leu Pro Gly Arg Met Ala Leu Tyr Gin 
435 440 445 

ctg gac cag cag tgc agg cag ate ttt ggg ccg gat ttc cgc cac tgc 1392 
Leu Asp Gin Gin Cys Arg Gin He Phe Gly Pro Asp Phe Arg His Cys 
450 455 460 



ccc aac acc tct get cag gac gtc tgc gee cag ctt tgg tgc cac act 
Pro Asn Thr Ser Ala Gin Asp Val Cys Ala Gin Leu Trp Cys His Thr 

470 475 480 



465 



1440 



1488 



gat ggg get gag ccc ctg tgc cac acg aag aat ggc age ctg ccc tgg 
Asp Gly Ala Glu Pro Leu Cys His Thr Lys Asn Gly Ser Leu Pro Trp 
485 490 495 

get gac ggc acg ccg tgc ggg cct ggg cac etc tgc tea gaa ggc age . 1536 
Ala Asp Gly Thr Pro Cys Gly Pro Gly His Leu Cys Ser Glu Gly Ser 
500 505 510 

tgt eta cct gag gag gaa gtg gag agg ccc aag ccc gtg gta gat gga 1584 
Cys Leu Pro Glu Glu Glu Val Glu Arg Pro Lys Pro Val Val Asp Gly 
515 520 525 

ggc tgg gca ccg tgg gga ccc tgg gga gaa tgt tct egg acc tgt gga 1632 
Gly Trp Ala Pro Trp Gly Pro Trp Gly Glu Cys Ser Arg Thr Cys Gly 
530 535 540 



1680 



gga gga gta cag ttt tea cac cgt gag tgc aag gac ccc gag cct cag 
Gly Gly Val Gin Phe Ser His Arg Glu Cys Lys Asp Pro Glu Pro Gin 
545 550 555 560 

aat gga gga aga tac tgc ctg ggt egg aga gec aag tac cag tea tgc 1728 
Asn Gly Gly Arg Tyr Cys Leu Gly Arg Arg Ala Lys Tyr Gin Ser Cys 
565 570 575 

cac acg gag gaa tgc ccc cct gac ggg aaa age ttc agg gag cag cag 1776 
His Thr Glu Glu Cys Pro Pro Asp Gly Lys Ser Phe Arg Glu Gin Gin 
580 585 590 

tgt gag aag tat aat gee tac aat tac act gac atg gac ggg aat etc 1824 
Cys Glu Lys Tyr Asn Ala Tyr Asn Tyr Thr Asp Met Asp Gly Asn Leu 
595 600 605 

ctg cag tgg gtc ccc aag tat get ggg gtg tec ccc egg gac cgc tgc 1872 
Leu Gin Trp Val Pro Lys Tyr Ala Gly Val Ser Pro Arg Asp Arg Cys 
610 615 620 
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aag ttg ttc tgc cga gcc egg ggg agg 
Lys Leu Phe Cys Arg Ala Arg Gly Arg 
625 630 

gcc aag gtg att gat ggc acc ctg tgt 
Ala Lys Val lie Asp Gly Thr Leu Cys 
645 



age gag ttc aaa gtg ttc gag 1920 
Ser Glu Phe Lys Val Phe Glu 
635 640 

ggg cca gaa aca ctg gcc ate 1968 
Gly Pro Glu Thr Leu Ala lie 
650 655 



tgt gtc cgt ggc cag tgt gtc aag gcc ggc tgt gac cat gtg gtg gac 2016 
Cys Val Arg Gly Gin Cys Val Lys Ala Gly Cys Asp His Val Val Asp 
660 665 670 



teg cct egg aag ctg gac aaa tgc ggg gtg tgt ggg ggc aaa ggc aac 2064 
Ser Pro Arg Lys Leu Asp Lys Cys Gly Val Cys Gly Gly Lys Gly Asn 
675 680 685 



tec tgc agg aag gtc tec ggg tec etc acc ccc acc aat tat ggc tac 2112 

Ser Cys Arg Lys Val Ser Gly Ser Leu Thr Pro Thr Asn Tyr Gly Tyr 
690 695 700 

aat gac att gtc acc ate cca get ggt gcc act aat att gac gtg aag 2160 

Asn Asp lie Val Thr lie Pro Ala Gly Ala Thr Asn lie Asp Val Lys 

705 710 715 720 



cag egg age cac ccg ggt gtg cag aac gat ggg aac tac ctg gcg ctg 2208 

Gin Arg Ser His Pro Gly Val Gin Asn Asp Gly Asn Tyr Leu Ala Leu 
725 730 735 

aag acg get gat ggg cag tac ctg etc aac ggc aac ctg gcc ate tct 2256 

Lys Thr Ala Asp Gly Gin Tyr Leu Leu Asn Gly Asn Leu Ala lie Ser 
740 745 750 

gcc ata gag cag gac ate ttg gtg aag ggg acc ate ctg aag tac age 2304 

Ala lie Glu Gin Asp lie Leu Val Lys Gly Thr lie Leu Lys Tyr Ser 

755 760 765 



ggc tec ate gcc acc ctg gag cgc ctg cag age ttc egg ccc ttg cca 2352 

Gly Ser lie Ala Thr Leu Glu Arg Leu Gin Ser Phe Arg Pro Leu Pro 
770. 775 780 

gag cct ctg aca gtg cag etc ctg aca gtc cct ggc gag gtc ttc ccc 2400 

Glu Pro Leu Thr Val Gin Leu Leu Thr Val Pro Gly Glu Val Phe Pro 

785 790 795 800 



cca aaa gtc aaa tac acc ttc ttt gtt cct aat gac gtg gac ttt age 2448 
Pro Lys Val Lys Tyr Thr Phe Phe Val Pro Asn Asp Val Asp Phe Ser 
805 810 815 



atg cag age age aaa gag aga gca acc acc aac ate ate cag ccg ctg 24 96 
Met Gin Ser Ser Lys Glu Arg Ala Thr Thr Asn lie lie Gin Pro Leu 
820 " 825 830 
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ctc cac gca cag tgg gtg ctg ggg gac tgg tct gag tgc tct age acc 254 4 

Leu His Ala Gin Trp Val Leu Gly Asp Trp Ser Glu Cys Ser Ser Thr 
835 840 845 

tgc ggg gec ggc tgg cag agg cga act gta gag tgc agg gac ccc tec 2592 

Cys Gly Ala Gly Trp Gin Arg Arg Thr Val Glu Cys Arg Asp Pro Ser 

850 . 855 860 

ggc cag gec tct gec acc tgc aac aag get ctg aaa ccc gag gat gec 264 0 

Gly Gin Ala Ser Ala Thr Cys Asn Lys Ala Leu Lys Pro Glu Asp Ala 
865 870 875 880 

aag ccc tgc gaa age cag ctg tgc ccc ctg tgattcaggg gggcaggggc 2690 
Lys Pro Cys Glu Ser Gin Leu Cys Pro Leu 
885 890 

cagtcttgtg ctcctggaca tgcggtactg aggtgeagae aaggtctcca ctgtggtgac 2750 

tgggtccctt ggecatatea aggcagcacg gcccacccag gcctcccatt gccgcaaccc 2810 

ctccagtact gcacaaattc ctaaggggga agagaaaagg tatggggegg caaaacctat 2870 

catcaactgt ccawtgnaat ggaacttget cgggttcaat taaaggcata agttaaagta 2930 

aattcattat gatcaacaga cctcacntca tctgttgcan gatacaacta ntaaaaaaaa 2990 

aaaaaaaaaa aaaaaaaa 3008 



<210> 4 
<211> 890 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Phe Pro Ala Pro Ala Ala Pro Arg Trp Leu Pro Phe Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Pro Leu Ala Arg Gly Ala Pro Ala Arg Pro 
20 25 30 

Ala Ala Gly Gly Gin Ala Ser Glu Leu Val Val Pro Thr Arg Leu Pro 
35 40 45 

Gly Ser Ala Gly Glu Leu Ala Leu His Leu Ser Ma Phe Gly Lys Gly 
50 55 60 

Phe Val Leu Arg Leu Ala Pro Asp Asp Ser Phe Leu Ala Pro Glu Phe 
65 70 75 80 

Lys He Glu Arg Leu Gly Gly Ser Gly Arg Ala Thr Gly Gly Glu Arg 
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-16- 

90 



95 



Gly Leu Arg Gly Cys Phe Phe Ser Gly Thr Val Asn Gly Glu Pro Glu 
100 105 110 

Ser Leu Ala Ala Val Ser Leu Cys Arg Gly Leu Ser Gly Ser Phe Leu 
115 120 125 

Leu Asp Gly Glu Glu Phe Thr lie Gin Pro Gin Gly Ala Gly Gly Ser 
130 135 140 

Leu Ma Gin Pro His Arg Leu Gin Arg Trp Gly Pro Ala Gly Ala Arg 
145 150 155 160 

Pro Leu Pro Arg Gly Pro Glu Trp Glu Val Glu Thr Gly Glu Gly Gin 
165 170 175 

Arg Gin Glu Arg Gly Asp His Gin Glu Asp Ser Glu Glu Glu Ser Gin 
180 185 190 

Glu Glu Glu Ala Glu Gly Ala Ser Glu Pro Pro Pro Pro Leu Gly Ala 
195 200 205 

Thr Ser Arg Thr Lys Arg Phe Val Ser Glu Ala Arg Phe Val Glu Thr 
210 215 220 

Leu Leu Val Ala Asp Ala Ser Met Ala Ala Phe Tyr Gly Ala Asp Leu 
225 230 235 240 

Gin Asn His lie Leu Thr Leu Met Ser Val Ala Ala Arg lie Tyr Lys 
245 250 255 

His Pro Ser lie Lys Asn Ser lie Asn Leu Met Val Val Lys Val Leu 
260 265 270 

lie Val Glu Asp Glu Lys Trp Gly Pro Glu Val Ser Asp Asn Gly Gly 
275 280 285 

Leu Thr Leu Arg Asn Phe Cys Asn Trp Gin Arg Arg Phe Asn Gin Pro 
290 295 300 

Ser Asp Arg His Pro Glu His Tyr Asp Thr Ala lie Leu Leu Thr Arg 
305 310 315 320 

Gin Asn Phe Cys Gly Gin Glu Gly Leu Cys Asp Thr Leu Gly Val Ala 
325 330 335 



Asp lie Gly Thr lie Cys Asp Pro Asn Lys Ser Cys Ser Val lie Glu 
340 345 350 



Asp Glu Gly Leu Gin Ala Ala His Thr Leu Ala His Glu Leu Gly His 
355 360 365 
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Val Leu Ser Met Pro His Asp Asp Ser Lys Pro Cys Thr Arg Leu Phe 
370 * 375 380 

Gly Pro Met Gly Lys His His Val Met Ala Pro Leu Phe Val His Leu 
385 390 395 400 

Asn Gin Thr Leu Pro Trp Ser Pro Cys Ser Ala Met Tyr Leu Thr Glu 
405 410 415 

Leu Leu Asp Gly Gly His Gly Asp Cys Leu Leu Asp Ala Pro Gly Ala 
420 425 430 

Ala Leu Pro Leu Pro Thr Gly Leu Pro Gly Arg Met Ala Leu Tyr Gin 
435 440 445 

Leu Asp Gin Gin Cys Arg Gin lie Phe Gly Pro Asp Phe Arg His Cys 
450 455 460 

Pro Asn Thr Ser Ala Gin Asp Val Cys Ala Gin Leu Trp Cys His Thr 
465 470 475 480 

Asp Gly Ala Glu Pro Leu Cys His Thr Lys Asn Gly Ser Leu Pro Trp 
485 490 495 

Ala Asp Gly Thr Pro Cys Gly Pro Gly His Leu Cys Ser Glu Gly Ser 
500 505 510 

Cys Leu Pro Glu Glu Glu Val Glu Arg Pro Lys Pro Val Val Asp Gly 
515 520 525 

Gly Trp Ala Pro Trp Gly Pro Trp Gly Glu Cys Ser Arg Thr Cys Gly 
530 535 540 

Gly Gly Val Gin Phe Ser His Arg Glu Cys Lys Asp Pro Glu Pro Gin 
545 550 555 560 

Asn Gly Gly Arg Tyr Cys Leu Gly Arg Arg Ala Lys Tyr Gin Ser Cys 
565 570 575 

His Thr Glu Glu Cys Pro Pro Asp Gly Lys Ser Phe Arg Glu Gin Gin 
580 585 590 

Cys Glu Lys Tyr Asn Ala Tyr Asn Tyr Thr Asp Met Asp Gly Asn Leu 
595 600 605 



Leu Gin Trp Val Pro Lys Tyr Ala Gly Val Ser Pro Arg Asp Arg Cys 
610 615 620 



Lys Leu Phe Cys Arg Ala Arg Gly Arg Ser Glu Phe Lys Val Phe Glu 
625 630 635 640 
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Ala Lys Val lie Asp Gly Thr Leu Cys Gly Pro Glu Thr Leu Ala lie 
645 650 655 

Cys Val Arg Gly Gin Cys Val Lys Ala Gly Cys Asp His Val Val Asp 
660 665 670 

Ser Pro Arg Lys Leu Asp Lys Cys Gly Val Cys Gly Gly Lys Gly Asn 
675 680 685 

Ser Cys Arg Lys Val Ser Gly Ser Leu Thr Pro Thr Asn Tyr Gly Tyr 
690 695 700 

Asn Asp He Val Thr He Pro Ala Gly Ala Thr Asn He Asp Val Lys 
705 710 715 720 

Gin Arg Ser His Pro Gly Val Gin Asn Asp Gly Asn Tyr Leu Ala Leu 
725 730 735 

Lys Thr Ala Asp Gly Gin Tyr Leu Leu Asn Gly Asn Leu Ala He Ser 
740 745 750 



Ala He Glu Gin Asp He Leu Val Lys Gly Thr He Leu Lys Tyr Ser 
755 760 765 



Gly Ser lie Ala Thr Leu Glu Arg 
770 775 

Glu Pro Leu Thr Val Gin Leu Leu 
785 790 

Pro Lys Val Lys Tyr Thr Phe Phe 
805 

Met Gin Ser Ser Lys Glu Arg Ala 
820 



Leu Gin Ser Phe Arg Pro Leu Pro 
780 

Thr Val Pro Gly Glu Val Phe Pro 
795 800 

Val Pro Asn Asp Val Asp Phe Ser 
810 815 

Thr Thr Asn He He Gin Pro Leu 
825 830 



Leu His Ala Gin Trp Val Leu Gly Asp Trp Ser Glu Cys Ser Ser Thr 

835 840 845 

Cys Gly Ala Gly Trp Gin Arg Arg Thr Val Glu Cys Arg Asp Pro Ser 
850 855 860 

Gly Gin Ala Ser Ala Thr Cys Asn Lys Ala Leu Lys Pro Glu Asp Ala 
865 870 875 880 



Lys Pro Cys Glu Ser Gin Leu Cys Pro Leu 
885 890 



<210> 5 
<211> 1203 
<212> PRT 
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<213> Bovine 
<400> 5 

Met Asp Pro Pro Ala Gly Ala Ala Gly Arg Leu Leu Cys Pro Ala Leu 
1 5 10 15 

Leu Leu Leu Leu Leu Leu Pro Leu Pro Ala Asp Ala Arg Leu Ala Ala 
20 25 30 

Ala Ala Ala Asp Pro Pro Gly Gly Pro Gin Gly His Gly Ala Glu Arg 
35 40 45 

lie Leu Ala Val Pro Val Arg Thr Asp Ala Gin Gly Arg Leu Val Ser 
50 55 60 

His Val Val Ser Ala Ala Thr Ala Pro Ala Gly Val Arg Thr Arg Arg 
65 70 75 80 

Ala Ala Pro Ala Gin lie Pro Gly Leu Ser Gly Gly Ser Glu Glu Asp 
85 90 95 

Pro Gly Gly Arg Leu Phe Tyr Asn Val Thr Val Phe Gly Arg Asp Leu 
100 105 110 

His Leu Arg Leu Arg Pro Asn Ala Arg Leu Val Ala Pro Gly Ala Thr 
115 120 125 

Val Glu Trp Gin Gly Glu Ser Gly Ala Thr Arg Val Glu Pro Leu Leu 
130 135 140 

Gly Thr Cys Leu Tyr Val Gly Asp Val Ala Gly Leu Ala Glu Ser Ser 
145 150 155 160 

Ser Val Ala Leu Ser Asn Cys Asp Gly Leu Ala Gly Leu lie Arg Met 
165 170 175 

Glu Glu Glu Glu Phe Phe lie Glu Pro Leu Glu Lys Gly Leu Ala Ala 
180 185 190 

Lys Glu Ala Glu Gin Gly Arg Val His Val Val Tyr His Arg Pro Thr 
195 200 205 

Thr Ser Arg Pro Pro Pro Leu Gly Gin Ala Leu Asp Thr Gly lie Ser 
210 215 220 

Ala Asp Ser Leu Asp Ser Leu Ser Arg Ala Leu Gly Val Leu Glu Glu 
225 230 235 240 

Arg Val Asn Ser Ser Arg Arg Arg Met Arg Arg His Ala Ala Asp Asp 
245 250 255 



Asp Tyr Asn lie Glu Val 



Leu Leu Gly Val Asp Asp Ser Val Val Gin 
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260 



265 



270 



Phe His Gly Thr Glu His Val Gin Lys Tyr Leu Leu Thr Leu Met Asn 
275 280 285 

He Val Asn Glu He Tyr His Asp Glu Ser Leu Gly Ala His He Asn 
290 295 300 

Val Val Leu Val Arg He He Leu Leu Ser Tyr Gly Lys Ser Met Ser 
305 310 315 320 

Leu He Glu He Gly Asn Pro Ser Gin Ser Leu Glu Asn Val Cys Arg 
325 330 335 

Trp Ala Tyr Leu Gin Gin Lys Pro Asp Thr Asp His Asp Glu Tyr His 
340 345 , 350 

Asp His Ala He Phe Leu Thr Arg Gin Asp Phe Gly Pro Ser Gly Met 
355 360 365 

Gin Gly Tyr Ala Pro Val Thr Gly Met Cys His Pro Val Arg Ser Cys 
370 375 380 

Thr Leu Asn His Glu Asp Gly Phe Ser Ser Ala Phe Val Val Ala His 
385 390 395 400 

Glu Thr Gly His Val Leu Gly Met Glu His Asp Gly Gin Gly Asn Arg 
405 410 415 

Cys Gly Asp Glu Val Arg Leu Gly Ser He Met Ala Pro Leu Val Gin 
420 425 430 

Ala Ala Phe His Arg Phe His Trp Ser Arg Cys Ser Gin Gin Glu Leu 
435 440 445 

Ser Arg Tyr Leu His Ser Tyr Asp Cys Leu Arg Asp Asp Pro Phe Thr 
450 455 460 

His Asp Trp Pro Ala Leu Pro Gin Leu Pro Gly Leu His Tyr Ser Met 
465 470 475 480 

Asn Glu Gin Cys Arg Phe Asp Phe Gly Leu Gly Tyr Met Met Cys Thr 
485 490 495 

Ala Phe Arg Thr Phe Asp Pro Cys Lys Gin Leu Trp Cys Ser His Pro 
500 505 510 

Asp Asn Pro Tyr Phe Cys Lys Thr Lys Lys Gly Pro Pro Leu Asp Gly 
515 520 525 



Thr Met Cys Ala Pro Gly Lys His Cys Phe Lys Gly His Cys He Trp 
530 535 540 
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.Leu Thr Pro Asp lie Leu Lys Arg Asp Gly Asn Trp Gly Ala Trp Ser 
545 550 555 560 

Pro Phe Gly Ser Cys Ser Arg Thr Cys Gly Thr Gly Val Lys Phe Arg 
565 570 575 

Thr Arg Gin Cys Asp Asn Pro His Pro Ala Asn Gly Gly Arg Thr Cys 
580 585 590 

Ser Gly Leu Ala Tyr Asp Phe Gin Leu Cys Asn Ser Gin Asp Cys Pro 
595 600 605 

Asp Ala Leu Ala Asp Phe Arg Glu Glu Gin Cys Arg Gin Trp Asp Leu 
610 615 620 

Tyr Phe Glu His Gly Asp Ala Gin His His Trp Leu Pro His Glu His 
625 630 635 640 

Arg Asp Ala Lys Glu Arg Cys His Leu Tyr Cys Glu Ser Lys Glu Thr 
645 650 655 

Gly Glu Val Val Ser Met Lys Arg Met Val His Asp Gly Thr Arg Cys 
660 665 670 . 

Ser Tyr Lys Asp Ala Phe Ser Leu Cys Val Arg Gly Asp Cys Arg Lys 
675 680 685 

Val Gly Cys Asp Gly Val lie Gly Ser Ser Lys Gin Glu Asp Lys Cys 
690 695 700 

Gly Val Cys Gly Gly Asp Asn Ser His Cys Lys Val Val Lys Gly Thr 
705 710 715 720 

Phe Ser Arg Ser Pro Lys Lys Leu Gly Tyr lie Lys Met Phe Glu lie 
725 730 735 

Pro Ala Gly Ma Arg His Leu Leu He Gin Glu Ala Asp Thr Thr Ser 
740 745 750 

His His Leu Ala Val Lys Asn Leu Glu Thr Gly Lys Phe He Leu Asn 
755 760 765 

Glu Glu Asn Asp Val Asp Pro Asn Ser Lys Thr Phe He Ala Met Gly 
770 *775 780 

Val Glu Trp Glu Tyr Arg Asp Glu Asp Gly Arg Glu Thr Leu Gin Thr 
785 790 795 800 



Met Gly Pro Leu His Gly Thr He Thr Val Leu Val He Pro Glu Gly 
805 810 815 
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Asp Ala Arg lie Ser Leu Thr Tyr Lys Tyr Met lie His Glu Asp Ser 
820 825 830 

Leu Asn Val Asp Asp Asn Asn Val Leu Glu Asp Asp Ser Val Gly Tyr 
835 840 845 

Glu Trp Ala Leu Lys Lys Trp Ser Pro Cys Ser Lys Pro Cys Gly Gly 
850 855 860 

Gly Ser Gin Phe Thr Lys Tyr Gly Cys Arg Arg Arg Leu Asp His Lys 
865 870 875 880 

Met Val His Arg Gly Phe Cys Asp Ser Val Ser Lys Pro Lys Ala lie 
885 890 895 

Arg Arg Thr Cys Asn Pro Gin Glu Cys Ser Gin Pro Val Trp Val Thr 
900 905 910 

Gly Glu Trp Glu Pro Cys Ser Arg Ser Cys Gly Arg Thr Gly Met Gin 
915 920 925 

Val Arg Ser Val Arg Cys Val Gin Pro Leu His Asn Asn Thr Thr Arg 
930 935 940 

Ser Val His Thr Lys His Cys Asn Asp Ala Arg Pro Glu Gly Arg Arg 
945 950 955 960 

Ala Cys Asn Arg Glu Leu Cys Pro Gly Arg Trp Arg Ala Gly Ser Trp 
965 970 975 

Ser Gin Cys Ser Val Thr Cys Gly Asn Gly Thr Gin Glu Arg Pro Val 
980 985 990 

Leu Cys Arg Thr Ala Asp Asp Ser Phe Gly Val Cys Arg Glu Glu Arg 
995 1000 1005 

Pro Glu Thr Ala Arg lie Cys Arg Leu Gly Pro Cys Pro Arg Asn Thr 
1010 1015 1020 

Ser Asp Pro Ser Lys Lys Ser Tyr Val Val Gin Trp Leu Ser Arg Pro 
1025 1030 1035 1040 

Asp Pro Asn Ser Pro Val Gin Glu Thr Ser Ser Lys Gly Arg Cys Gin 
1045 1050 1055 

Gly Asp Lys Ser Val Phe Cys Arg Met Glu Val Leu Ser Arg Tyr Cys 
1060 1065 1070 

Ser lie Pro Gly Tyr Asn Lys Leu Cys Cys Lys Ser Cys Asn Pro His 
1075 1080 1085 

Asp Asn Leu Thr Asp Val Asp Asp Arg Ala Glu Pro Pro Ser Gly Lys 
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1090 1095 1100 

His Asn Asp He Glu Glu Leu Met Pro Thr Leu Ser Val Pro Thr Leu 
1105 1110 1115 1120 

Val Met Glu Val Gin Pro Pro Pro Gly He Pro Leu Glu Val Pro Leu 
1125 1130 1135 

Asn Thr Ser Ser Thr Asn Ala Thr Glu Asp His Pro Glu Thr Asn Ala 
1140 1145 1150 

Val Asp Val Pro Tyr Lys He Pro Gly Leu Glu Asp Glu Val Gin Pro 
1155 1160 1165 

Pro Asn Leu He Pro Arg Arg Pro Ser Pro Tyr Glu Lys Thr Arg Asn 
1170 1175 1180 

Gin Arg lie Gin Glu Leu lie Asp Glu Met Arg Lys Lys Glu Met Leu 
1185 1190 1195 1200 

Gly Lys Phe 



<210> 6 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Asp Asp Gly Trp Ser Pro Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser 
1 5 10 15 

Cys Gly Asn Gly lie Gin Gin Arg Gly Arg Ser Cys Asp Ser Leu Asn 
20 25 30 

Asn Arg Cys Glu Gly Ser Ser Val Gin Thr Arg Thr Cys His lie Gin 
35 40 45 

Glu Cys 
50 



<210> 7 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser Val Thr 
15 10 15 
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Cys Gly Asp Gly Val lie Thr Arg lie Arg Leu Cys Asn Ser Pro Ser 
20 25 30 

Pro Gin Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu Thr Lys 
35 40 45 

Ala Cys Lys Lys Asp Ala Cys Pro lie 
50 55 



<210> 8 

<211> 57 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Asn Gly Gly Trp Gly Pro Trp Ser Pro Trp Asp lie Cys Ser Val Thr 
1 5 10 15 

Cys Gly Gly Gly Val Gin Lys Arg Ser Arg Leu Cys Asn Asn Pro Thr 
20 25 30 

Pro Gin Phe Gly Gly Lys Asp Cys Val Gly Asp Val Thr Glu Asn Gin 
35 40 45 

lie Cys Asn Lys Gin Asp Cys Pro He 
50 55 



<210> 9 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Glu Glu Gly Trp Ser Pro Trp Ala Glu Trp Thr Gin Cys Ser Val Thr 
15 10 15 

Cys Gly Ser Gly Thr Gin Gin Arg Gly Arg Ser Cys Asp Val Thr Ser 
20 25 30 

Asn Thr Cys Leu Gly Pro Ser He Gin Thr Arg Ala Cys Ser Leu Ser 
35 40 45 

Lys Cys 
50 



<210> 10 

<211> 57 

<212> PRT 

<213> Homo sapiens 
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<400> 10 

Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser Val Thr 
15 10 15 

Cys Gly Val Gly Asn lie Thr Arg lie Arg Leu Cys Asn Ser Pro Val 
20 25 30 

Pro Gin Met Gly Gly Lys Asn Cys Lys Gly Ser Gly Arg Glu Thr Lys 
35 40 45 

Ala Cys Gin Gly Ala Pro Cys Pro lie 
50 55 



<210> 11 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Asp Gly Arg Trp Ser Pro Trp Ser Pro Trp Ser Ala Cys Thr Val Thr 
1 5 10 15 

Cys Ala Gly Gly lie Arg Glu Arg Thr Arg Val Cys Asn. Ser Pro Glu 
20 25 30 

Pro Gin Tyr Gly Gly Lys Ala Cys Val Gly Asp Val Gin Glu Arg Gin 
35 40 45 

Met Cys Asn Lys Arg Ser Cys Pro 
50 55 



<210> 12 
<211> 3974 
<212> DNA 

<213> Homo sapiens 
<400> 12 

ggtacctaag tgagtagggc gtccgatcga cggacgcctt ttttttgaat tcgtaatcat 60 
ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 120 
ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 180 
cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 240 
tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca 300 
ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 360 
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taatacggtt atccacagaa tcaggggata 
agcaaaaggc caggaaccgt aaaaaggccg 
cccctgacga gcatcacaaa aatcgacgct 
tataaagata ccaggcgttt ccccctggaa 
tgccgcttac cggatacctg tccgcctttc 
gctcacgctg taggtatctc agttcggtgt 
acgaaccccc cgttcagccc gaccgctgcg 
acccggtaag acacgactta tcgccactgg 
cgaggtatgt aggcggtgct acagagttct 
gaagaacagt atttggtatc tgcgctctgc 
gtagctcttg atccggcaaa caaaccaccg 
agcagattac gcgcagaaaa aaaggatctc 
ctgacgctca gtggaacgaa aactcacgtt 
caattcgcgc gcgaaggcga agcggcatgc 
acctttcgcg gtatggcatg atagcgcccg 
aaaccagtaa cgttatacga tgtcgcagag 
cgcgtggtga accaggccag ccacgtttct 
atggcggagc tgaattacat tcccaaccgc 
ttgctgattg gcgttgccac ctccagtctg 
gcgattaaat ctcgcgccga tcaactgggt 
agcggcgtcg aagcctgtaa agcggcggtg 
ctgatcatta actatccgct ggatgaccag 
aatgttccgg cgttatttct tgatgtctct 
tcccatgaag acggtacgcg actgggcgtg 
atcgcgctgt tagcgggccc attaagttct 
cataaatatc tcactcgcaa tcaaattcag 



-26- 

acgcaggaaa gaacatgtga gcaaaaggcc 420 
cgttgctggc gtttttccat aggctccgcc 480 
caagtcagag gtggcgaaac ccgacaggac 540 
gctccctcgt gcgctctcct gttccgaccc 600 
tcccttcggg aagcgtggcg ctttctcata 660 
aggtcgttcg ctccaagctg ggctgtgtgc 720 
ccttatccgg taactatcgt cttgagtcca 780 
cagcagccac tggtaacagg attagcagag 840 
tgaagtggtg gcctaactac ggctacacta 900 
tgaagccagt taccttcgga aaaagagttg 960 
ctggtagcgg tggttttttt gtttgcaagc 1020 
aagaagatcc tttgatcttt tctacggggt 1080 
aagggatttt ggtcatgaga ttatcgtcga 1140 
atttacgttg acaccatcga atggtgcaaa 1200 
gaagagagtc aattcagggt ggtgaatgtg 1260 
tatgccggtg tctcttatca gaccgtttcc 1320 
gcgaaaacgc gggaaaaagt ggaagcggcg 1380 
gtggcacaac aactggcggg caaacagtcg 1440 
gccctgcacg cgccgtcgca aattgtcgcg 1500 
gccagcgtgg tggtgtcgat ggtagaacga 1560 
cacaatcttc tcgcgcaacg cgtcagtggg 1620 
gatgccattg ctgtggaagc tgcctgcact 1680 
gaccagacac ccatcaacag tattattttc 174 0 
gagcatctgg tcgcattggg tcaccagcaa 1800 
gtctcggcgc gtctgcgtct ggctggctgg 1860 
ccgatagcgg aacgggaagg cgactggagt 1920 
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gccatgtccg gttttcaaca aaccatgcaa 
atgctggttg ccaacgatca gatggcgctg 
ctgcgcgttg gtgcggatat ctcggtagtg 
tatatcccgc cgttaaccac catcaaacag 
gaccgcttgc tgcaactctc tcagggccag 
tcactggtga aaagaaaaac caccctggcg 
ttggccgatt cattaatgca gctggcacga 
gcgcaacgca attaatgtaa gttagcgcga 
ccactcctgc agttcggggg catggatgcg 
gacggatttg cactgccggt agaactccgc 
ccaactcgcg aggggatcga gcccggggtg 
ctggaggatc atccagccgg cgtcccggaa 
ggcggcggtg gaatcgaaat ctcgtgatgg 
gaaccccaga gtcccgctca gaagaactcg 
gaatcgggag cggcgatacc gtaaagcacg 
tcttcagcaa tatcacgggt agccaacgct 
cggccacagt cgatgaatcc agaaaagcgg 
gcatcgccat gggtcacgac gagatcctcg 
aacagttcgg ctggcgcgag cccctgatgc 
ccggcttcca tccgagtacg tgctcgctcg 
caggtagccg gatcaagcgt atgcagccgc 
tcggcaggag caaggtgaga tgacaggaga 
cagtcccttc ccgcttcagt gacaacgtcg 
gccagccacg atagccgcgc tgcctcgtcc 
gtcttgacaa aaagaaccgg gcgcccctgc 
cagccgattg tctgttgtgc ccagtcatag 
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atgctgaatg agggcatcgt tcccactgcg 1980 
ggcgcaatgc gcgccattac cgagtccggg 204 0 
ggatacgacg ataccgaaga cagctcatgt 2100 
gattttcgcc tgctggggca aaccagcgtg 2160 
gcggtgaagg gcaatcagct gttgcccgtc 2220 
cccaatacgc aaaccgcctc tccccgcgcg 2280 
caggtttccc gactggaaag cgggcagtga 234 0 
attgtcgacc aaagcggcca tcgtgcctcc 2400 
cggatagccg ctgctggttt cctggatgcc 24 60 
gaggtcgtcc agcctcaggc agcagctgaa 2520 
ggcgaagaac tccagcatga gatccccgcg 2580 
aacgattccg aagcccaacc tttcatagaa 2640 
caggttgggc gtcgcttggt cggtcatttc 2700 
tcaagaaggc gatagaaggc gatgcgctgc 2760 
aggaagcggt cagcccattc gccgccaagc 2820 
atgtcctgat agcggtccgc cacacccagc 2880 
ccattttcca ccatgatatt cggcaagcag 2940 
ccgtcgggca tgcgcgcctt gagcctggcg 3000 
tcttcgtcca gatcatcctg atcgacaaga 3060 
atgcgatgtt tcgcttggtg gtcgaatggg 3120 1 
cgcattgcat cagccatgat ggatactttc 3180 
tcctgccccg gcacttcgcc caatagcagc 3240 
agcacagctg cgcaaggaac gcccgtcgtg 3300 
tgcagttcat tcagggcacc ggacaggtcg 3360 
gctgacagcc ggaacacggc ggcatcagag 3420 
ccgaatagcc tctccaccca agcggccgga 3480 



WO 99/37660 



PCT/US99/01313 



-28- 



gaacctgcgt gcaatccatc ttgttcaatc atgcgaaacg atcctcatcc tgtctcttga 3540 

tcagatcttg atcccctgcg ccatcagatc cttggcggca agaaagccat ccagtttact 3600 

ttgcagggct tcccaacctt accagagggc gccccagctg gcaattccgg ttcgcttgct 3660 

gtccataaaa ccgcccagtc tagctatcgc catgtaagcc cactgcaagc tacctgcttt 3720 

ctctttgcgc ttgcgttttc ccttgtccag atagcccagt agctgacatt catccggggt 3780 

cagcaccgtt tctgcggact ggctttctac gtgttccgct tcctttagca gcccttgcgc 3840 

cctgagtgct tgcggcagcg tgaagcttaa aaaactgcaa aaaatagttt gacttgtgag 3900 

cggataacaa ttaagatgta cccaattgtg agcggataac aatttcacac attaaagagg 3960 

agaaattaca tatg 3974 



<210> 13 

<211> 112 

<212> DNA 

<213> Homo sapiens 

<400> 13 

aagcttaaaa aactgcaaaa aatagtttga cttgtgagcg gataacaatt aagatgtacc 60 
caattgtgag cggataacaa tttcacacat taaagaggag aaattacata tg 112 



<210> 14 

<211> 542 

<212> DNA 

<213> Mus rausculus 

<220> 

<221> UNSURE 
<222> (3) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (21) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (22) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (361) 

<223> May be any nucleic acid 



WO 99/37660 



-29- 



PCTAJS99/01313 



<220> 

<221> UNSURE 
<222> (369) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (407) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (427) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (479) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (482) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (535) 

<223> May be any nucleic acid 
<400> 14 

gtncgaattt cggcacgaga nnttagacgc 
cttggggaga ctgttcgaga acgtgcggtg 
acaacccagt cccaaagaat ggagggaagt 
cctgtaacct tgaggactgt ccagacaata 
aagcacacaa cgagttttca aaagcttcct 
ccaagtacgc tggcgtctca ccaaaggaca 
nttggctant tctttcgttt tgcagcccaa 
cagattncac ctttgtctgt gtgcaaggac 
cntagattcc aaaaggagtt ttgttaatgt 
aa 



cttttcatgg aagctgggga atgtgggggc 60 
gaggagtcca gtacacgatg agggaatgtg 120 
actgtgaagg caaacgagtg cgctacagat 180 
atggaaaaac ctttagagag gaacaatgtg 240 
ttgggagtgg gcctgcggtg gaatggattc 300 
ggtgcaagtt catgttgcca agccaaaggc 360 
ggttgttagg tgggtantcc atgttaggcc 420 
agtgtgttaa aagttggttg tgatccgcnt 4 80 
ggtgttttcn gggggaatgg tctantttta 540 

542 



<210> 15 
<211> 320 
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<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 15 

cagagaacat tcgccccact cttcaatgac ccatgctgaa aaagtgggga tagcattgaa 60 
agattccttc ttcttcttta cgaagtaggt gtatttaatt ttaggtcgaa gggcattgcc 120 
cacagtaaga acctggatgg tcaagggctc tttgagaggg ctaaagctgc gaattctttc 180 
caatgccgca gaggagccgc tgtacctcaa gacaacacct ttgtacataa tgtcttgctc 240 
taaggtggac aaagtgtagt caccattaag aatatatgtg ccatcagcag ctttgatggc 300 
aagaaagctg cccttgttcc 320 

<210> 16 
<211> 316 
<212> DNA 

<213> Eimeria tenella 
<400> 16 

aatgccgaga cattaatgga cagcctgctt ccgagtgtgc aaaggaagtg aagccagcca 60 
gcaccagacc ttgtgcagac catccctgcc cccagtggca gctgggggag tggtcatcat 120 
gttctaagac ctgtgggaag ggttacaaaa aaagaagctt gaagtgtctg tcccatgatg 180 
gaggggtgtt atctcatgag agctgtgatc ctttaaagaa acctaaacat ttcatagact 240 
tttgcacaat ggcagaatgc agttaagtgg tttaagtggt gttagctttg agggcaaggc 300 
aaagtgagga agggct 316 



<210> 17 
<211> 383 
<212> DNA 

<213> Caenorhabditis elegans 
<220> 

<221> UNSURE 
<222> (160) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (326) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (358) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (366) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (377) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (379) 

<223> May be any nucleic acid 
<400> 17 

gtcgacccac gcgtccggat ggtactccat gtagcccaga ttccacctct gtctgtgtgc 60 
aaggacagtg tgtaaaagct ggttgtgatc gcatcataga ctccaaaaag aagtttgata 120 
aatgtggtgt ttgcggggga aatggatcta cttgtaaaan aatatcagga tcagttacta 180 
gtgcaaaacc tgggatatca tgatatcatc acaattccaa ctgggagcca ccaacatcga 240 
agtgaaacag cggaaccaga ggggatccag ggaacaatgg gcagctttct tgccatcaaa 300 
gctgctggat ggcacatata ttcttnaatg gtgactacac tttgtccacc ttagaganag 360 
acattntgtg acaaagngnt tgt 383 



<210> 18 
<211> 404 
<212> DNA 

<213> Crotalus atrox 



<220> 

<221> UNSURE 
<222> (21) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (301) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (335) 
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<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (373) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (378) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (382) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (383) 

<223> May be any nucleic acid 
<400> 18 

cccacgcgtc cgcccacggt nccgggactt gtgtgggtcc cagacatgtg atactcttgg 60 
gatggctgat gttggaactg tgtgtgatcc gagcagaagc tgctccgtca tagaagatga 120 
tggtttacaa gctgccttca ccacagccca tgaattaggc cacgtgttta acatgccaca 180 
tgatggatgc aaagcagtgt gccagcctta aatggtgtga accagggatt cccacatgat 240 
ggcgtcaatg ctttccaacc tgggaccaca gccagccttg ggtcctcctt gcagtggcct 300 
nacatggatt gacatcattt ctgggatgaa tggtncatgg gggaatgttt tgattggaca 360 
agccttcaga atnccctnac annttcccag gggttctccc tggg 404 



<210> 19 

<211> 152 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (105) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (122) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (135) 
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<223> May be any nucleic acid 
<400> 19 

atcgtagaag atgaaaaatg gggcccagag gtgtccgaca atggggggct tacactgcgt 60 
aacttctgca actggcagcg gcgtttcaac cagcccagcg accgncaccc agagcactac 120 
gncacggcca tcctnctcac cagacagaac tt 152 

<210> 20 
<211> 4180 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 20 

gcagctccga gctaggtgct atcgcaaggc cagagcgcac agcccggcgg agagagcaga 60 
tccttgctca gatcgagtca aatcgggcca aggcggagga cgaagagtcc aggctcctat 120 
tctggacttg ttccccagct ccgggggcgc ttctaggtcc tgcagcagcc agcagtgcgg 180 
agccaccaac tcggtgctgg aatgaaaaaa ttcccgcgcg ccagtgcaga atctttctaa 240 
gtgacccgga gcttcgggtg ctagctctgc acgaactttc ccatcaaagt gatcgtgaat 300 
tttaagcatc aggagcaggc cagcgaagct ctacgcgtct aaacgtctat ccagaccaag 360 
agttctctgc ggtgcagggt gcggtgccat gcagccaaaa gtcccttttg gggtcacgca 420 
agcagaagcc ctgctccgac atgggggacg tccagcgggc agcgagatct cggggctctc 480 
tgtccgcaca catgctgttg ctgctcctcg cttccataac aatgctgcta tgtgcgcggg 54 0 
gcgcacacgg gcgccccacg gaggaagatg aggagctggt cctgccctcg ctggagcgcg 600 
ccccgggcca cgattccacc accacacgcc ttcgtctgga cgcctttggc cagcagctac 660 
atctgaagtt gcagccggac agcggtttct tggcgcctgg cttcaccctg cagactgtgg 720 
ggcgcagtcc cgggtccgag gcacaacatc tggaccccac cggggacctg gctcactgct 780 
tctactctgg cacggtgaac ggtgatcccg gctctgccgc agccctcagc ctctgtgaag 840 
gtgtgcgtgg tgccttctac ctacaaggag aggagttctt cattcagcca gcgcctggag 900 
tggccaccga gcgcctggcc cctgccgtgc ccgaggagga gtcatccgca cggccgcagt 960 
tccacatcct gaggcgaagg cggcggggca gtggcggcgc caagtgcggc gtcatggacg 1020 



WO 99/37660 

acgagaccct gccaaccagc gactcgcgac 
ctgtgcggga ccccacgcct caggacgcgg 
agaagcgatt tgtgtccagc ccccgttatg 
tggccgactt ccacggcagc ggtctaaagc 
ccaggtttta caagcatccc agcattagga 
tggtcatata cgaggagcag aagggaccag 
ggaatttctg cagctggcag aaacaacaca 
atgacactgc aattctgttc accagacagg 
tcggaatggc agatgttgga accgtatgtg 
atgatggttt gcaagccgcc ttcaccacag 
cgcacgatga tgctaagcac tgtgccagct 
tggcctcgat gctctccagc ttagaccata 
tggtcacgtc cttcctagat aatggacacg 
caatcaagct cccttctgat cttcccggta 
ttacattcgg agaggaatcc aagcactgcc 
ggtgcactgg cacctccggt ggcttactgg 
atggcaccag ctgtggagaa gggaagtggt 
acatgaagca ttttgctact cctgttcatg 
actgctcaag aacctgtggt ggtggagttc 
tcccaaagaa cggagggaag tactgtgaag 
tcgaggactg tccagacaat aacggaaaaa 
atgagttttc caaagcttcc tttgggaatg 
ccggcgtctc gccaaaggac aggtgcaagc 
ttttcgtctt acagcccaag gttgtagatg 
tctgtgtgca agggcagtgt gtgaaagctg 
agtttgataa gtgtggcgtt tgtggaggaa 



PCT/US99/0I313 

-34- 

ccgagagcca gaacacccgg aaccagtggc 1080 
gaaagccatc aggaccagga agcataagga 1140 
tggaaaccat gctcgtagct gaccagtcca 1200 
attaccttct aaccctgttc tcggtggcag 1260 
attcaattag cctggtggtg gtgaagatct 1320 
aagttacctc caatgcagct ctcacccttc 1380 
acagccccag tgaccgggat ccagagcact 14 40 
atttatgtgg ctcccacacg tgtgacactc 1500 
accccagcag gagctgctca gtcatagaag 1560 
cccatgaatt gggccatgtg tttaacatgc 1620 
tgaatggtgt gagtggcgat tctcatctga 1680 
gccagccctg gtcaccttgc agtgcctaca 1740 
gggaatgttt gatggacaag ccccagaatc 1800 
ccttgtacga tgccaaccgc cagtgtcagt 1860 
ctgatgcagc cagcacatgt actaccctgt 1920 
tgtgccaaac aaaacacttc ccttgggcag 1980 
gtgtcagtgg caagtgcgtg aacaagacag 2040 
gaagctgggg accatgggga ccgtggggag 2100 
aatacacaat gagagaatgt gacaacccag 2160 
gcaaacgagt ccgctacagg tcctgtaaca 2220 
cgttcagaga ggagcagtgc gaggcgcaca 2280 
agcccactgt agagtggaca cccaagtacg 2340 
tcacctgtga agccaaaggc attggctact 2400 
gcactccctg tagtccagac tctacctctg 24 60 
gctgtgatcg catcatagac tccaaaaaga 2520 
acggttccac atgcaagaag atgtcaggaa 2580 



WO 99/37660 

tagtcactag tacaagacct gggtatcatg 
acattgaagt gaaacatcgg aatcaaaggg 
ttagagccgc tgatggtacc tatattctga 
aagacctcac ctacaaaggt actgtcttaa 
gaatccgcag ctttagtcca ctcaaagaac 
atgctctccg acccaaaatt aaattcacct 
acgccattcc cacattttct gagtgggtga 
gcggctcagg ttggcagaga agagtagtgc 
ccgaatgtgc aaaggaagtg aagccagcca 
cacactggca ggtgggggat tggtcaccat 
agagaacctt gaaatgtgtg tcccacgatg 
ctttgaagaa gccaaagcat tacattgact 
cgttagagga caaggtagcg tggggagggg 
gatccagtga gtcaaaccag taagcagtga 
catagcaaag gaggtagatc aggacactac 
atgaggcaca gtagcatctg aaagaccata 
agtatctctt ttcttatatc tatcgcccaa 
tgcactgtac taactagata cttcttatca 
ggtaagactg ggtttcaaac aaggcttggt 
aaacaagatc attattcgaa gtcgctggtt 
ttcctatcaa cagtgaaaag ttcagcttgt 
aagagctcct gatttcttct tacaccatct 
gaaacaattc atctatttat aaaatgtaca 
cacataaaaa ctgaaggaaa caatgagcaa 
aaacctgcct ggggattgag gttgtttaag 
taatacatgt tgtgccagag ttagaatggg 
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acattgtcac aattcctgct ggagccacca 2640 
ggtccagaaa caatggcagc tttctggcta 2700 
atggaaactt cactctgtcc acactagagc 2760 
ggtacagtgg ttcctcggct gcgctggaaa 2820 
ccttaaccat ccaggttctt atggtaggcc 2880 
actttatgaa gaagaagaca gagtcattca 2940 
ttgaagagtg gggggagtgc tccaagacat 3000 
agtgcagaga cattaacgga caccctgctt 3060 
gtaccagacc ttgtgcagac cttccttgcc 3120 
gttccaaaac ttgcgggaag ggttacaaga 3180 
ggggcgtgtt atcaaatgag agctgtgatc 3240 
tttgcacact gacacagtgc agttaagagg 3300 
ctgatacact gagtgcaaga gtactggagg 3360 
ggtgtggcaa ggaggtgtgt gtaggggata 3420 
cctgccagtt acattctgat aaggtagtta 3480 
cagagcacta aggagcccca aagcactatt 3540 
ataattttca gagtctggca gaagccctgt 3600 
caaagattgg gaaaggcaaa gcagaaagat 3660 
ttcaatcact ggaggcaagg aggaggggac 3720 
gctgtggttt tacggaaggt tgatgcatca 3780 
tcaacgtgac agaaaggctc atctccgtga 3840 
cagttcttaa ctatagttca tgttgaggta 3900 
ttggaaaaaa aaagtgaagt ttatgaggta 3960 
catgcctcct gctttgcttc ctcctgaggt 4020 
attatccatg gctcacaaga ggcagtaaaa 4080 
gtatagagat cagggtccca tgagatgggg 4140 
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aacatggtga tcactcatct cacatgggag gctgctgcag 4180 

<210> 21 
<211> 9248 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 21 

gcagctccga gctaggtgct atcgcaaggc cagagcgcac agcccggcgg agagagcaga 60 
tcc'ttgctca gatcgagtca aatcggggcc aaggcggagg acgaagagtc caggctccta 120 
ttctggactt gttccccagc tccgggggcg cttctaggtc ctgcagcagc caggagtgcg 180 
gagccaccaa ctcggtgctg gaatgaaaaa attcccgcgc gccagtgcag aatctttcta 240 
agtgacccgg agcttcgggt gctagctctg cacgaacttt cccatcaaag tgatcgtgaa 300 
ttttaagcat caggagcagg ccagcgaagc tctacgcgtc taaacgtcta tccagaccaa 360 
gagttctctg cggtgcaggg tgcggtgcca tgcagccaaa agtccctttg gggtcacgca 420 
agcagaagcc. ctgctccgac atgggggacg tccagcgggc agcgagatct cggggctctc 480 
tgtccgcaca catgctgttg ctgctcctcg cttccataac aatgctgcta tgtgcgcggg 540 ^ 
gcgcacacgg gcgccccacg gaggaagatg aggagctggt cctgccctcg ctggagcgcg 600 
ccccgggcca cgattccacc accacacgcc ttcgtctgga cgcctttggc cagcagctac 660 
atctgaagtt gcagccggac agcggtttct tggcgcctgg cttcaccctg cagactgtgg 720 
ggcgcagtcc cgggtccgag gcacaacatc tggaccccac cggggacctg gctcactgct 780 
tctactctgg cacggtgaac ggtgatcccg gctctgccgc agccctcagc ctctgtgaag 840 
gtgtgcgtgg tgccttctac ctacaaggag aggagttctt cattcagcca gcgcctggag 900 
tggccaccga gcgcctggcc cctgccgtgc ccgaggagga gtcatccgca cggccgcagt 960 
tccacatcct gaggcgaagg cggcggggca gtggcggcgc caagtgcggc gtcatggacg 1020 
acgagaccct gccaaccagc gactcgcgac ccgagagcca gaacacccgg aaccagtggc 1080 
ctgtgcggga ccccacgcct caggacgcgg gaaagccatc aggtataaga gtgaccccca 1140 
tctctcagtc tttacgaggc gtgacttggg gtcacactcc agatcgcctc taaatgcgaa 1200 
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tgactcagac ttgcagtgaa ttgaagttct 
aaaaaaagtg tgaccatact ctgctagaac 
gaaagagaga aagaatcgga ggtcctgtag 
aataacccga ctccttacac tggagaatct 
gtatgagttc actttaacta ggttgtagtt 
tgagcacgta gcgctcaaag gagggcggag 
tcaggagcct gggcaacggc agtggtgatc 
cccccatttg aaaagctgac cccgatggct 
tgcttgggag gtctttgctt ggttcgcccc 
tgctcgcact ccctctcccg gcagaggttg 
gactgatcct ccctgctaca cattcgcctg 
ttagttcttt ttaatcattc aaatttcttt 
cgacttgtcc ctaggagctg gtattcatat 
ctcagcagcc cgaaaaaggc cattttccaa 
attccccccg gggcactggg agtgagagga 
ggtcctagtg cccggaactg gatcaagcag 
gctgagcctt acagactgag actcctcaag 
ccccatgcct tccacgtgga gctggatgat 
agtgaaaggg tgactcaggg cgtccgcctg 
gtagaaatga tagccatact gcctcactca 
aatctgcgga cacatatata gacagtttga 
ttttgtctta tttaaagcaa aagaaaaaag 
tgcaaatatt tgttttattt ttgttccagg 
ttgactttcc taatataccc cgagaaggtc 
acttcctttt ctttttacag gaccaggaag 
ccgttatgtg gaaaccatgc tcgtggctga 



gggtcgtgac cttcccgctc cccccccccc 1260 
acttatttgc ccgaatagtt aataatttga 1320 
ataagggcta agcgtttcct ccgcgaagcc 1380 
ctctccatcc ctttaatgcc tttagtgaat 1440 
tcgcgctgag ttttgtaacg tcagtccgtg 1500 
tagaggagcc atggtgacct ggatgtgcgt 1560 
tcatttctgt ggccttccgt ctgtcccctt 1620 
ggtggctccg ttgggcccct ctgcagaacc 1680 
gcctccacgc gcctcctacc tcggcctcgt 1740 
gactccccag cgctgtggaa tgttagcctg 1800 
actctgccgt gttcagtctc taccagccag 1860 
ttgccctttt ctagatttct ccctcttttc 1920 
cctactttac gatttctctg accgctgagt 1980 
attggcaacc ctggtttgag aaaggaactt 2040 
ggcaggaaaa cactgctggg cagagtgggt 2100 
agaaccccct gggacccctt gaatgagaga 2160 
ccccacccct tggctgagct ccccgccctg 2220 
ctcattcggg atttcagccc tggcttcaat 2280 
cttctcttgc caagttttta ctacagctgg 2340 
ggctgtggag tcttcaaaga ccacaaaaga 2400 
tcactctgtt gcttgctttg ttttgttttg 2460 
acttaaaaat aactcacagt ttttagaaga 2520 
tgtatttcag ttttatttac tttgactagg 2580 
actattagga gaaggactgc ccatgagcaa 2640 
cataaggaag aagcgatttg tgtccagccc 2700 
ccagtccatg gccgacttcc acggcagcgg 2760 
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tctaaagcat taccttctaa ccctgttctc ggtggcagcc aggttttaca agcatcccag 2820 
cattaggaat tcaattagcc tggtggtggt gaagatcttg gtcatatatg aggagcagaa 2880 
gggaccagaa gttacctcca atgcagctct cacccttcgg aatttctgca actggcagaa 2940 
acaacacaac agccccagtg accgggatcc agagcactat gacactgcaa ttctgttcac 3000 
cagacaggta agacaggagc ttatcaacca tttcatcaac tcaactcgga ggtcagcctt 3060 
gtgttggatg ggatgagagg gtgggggtgt ggcggagagg aaacccagaa ggggatgaca 3120 
tttgaaatgt aaacaaaata accaattaaa aaaaaaaggc atctcatctg tattgcctca 3180 
tttcctttcg gttataggct agctcaatct gtcttgctta tttctatttt aaacttccac 3240 
atctcaagtt ctacagttct attttaaaag cattacaggg aatcttgctt agagtcagtc 3300 
cttcaagccc agcaataatg aatggacagg cttcaaagtg catgtgaaga cacgcccaac 3360 
tgaagagcta agtatcactc tctcctactt aaaagggatt tcccttgcct ctttgtagga 3420 
tttatgtggc tcccacacgt gtgacactct cgggatggca gatgttggaa ctgtatgtga 3480 
ccccagcagg agctgctcag tcatagaaga tgatggtttg caagccgcct tcaccacagc 3540 
ccacgaattg ggtaagtcgg cttcagagta caagttaagc ccaaatgcat ggatacaacc 3600 
caataagtca atctgatgtg acgagagaga aaacatctca gactatgttg ctacctcagc 3660 
caccagcaat- tttagaaggg gtagggtata ttttccacga tttcaagtat ggtcttacta 3720 
ggacaggaga aagtggtaca aacatttgaa cgttgacatt tttatacttg ccctgatcaa 3780 
agtgagtatg agccccaata caggttgtct aataagagag ccattgagcc tcactcaata 3840 
atacagctga atgtccttct tgtctgcttc ccaggccatg tgtttaacat gccgcacgat 3900 
gatgctaagc actgtgccag cttgaatggt gtgactggcg attctcatct gatggcctcg 3960 
atgctctcca gcttagacca tagccagccc tggtcacctt gcagtgccta catggtcacg 4020 
tccttcctag ataatggaca cggtaagatg acagctcctc tttccagatg gtgttcaacc 4080 
ttccttgtgt agggctctct ctggctaagt gagctccatg gctcttgctc atttcccctc 4140 
cttcagagtt ttctctggca ggatcataag tagtagatct ttacctccat tgcatcctgc 4200 
tcccaaagtc cattcattca taaacaataa cttctcgcca ttgtaaaatc agaagtcccc 4260 
tattgaggat aacgtctcga taaaaatcta aagttcccta gcattgattt tcccaaaaat 4320 
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gcatgatttc accaaacatg tattaataat 
tattatttta ggggaatgtt tgatggacaa 
tcttcccggt accttgtacg atgccaaccg 
caagcactgc cctgatgcag ccagcacatg 
tggcttactg gtgtgccaaa caaaacactt 
agggaagtgg tgtgtcagtg gcaagtgcgt 
gagttttccc aatgaaacat atccgtttgc 
tttagttcct ttcctagaca aactcctcta 
tagcgtattt tgagaattga ttgtctgttt 
ttatagttct aacactgata tctgtatctc 
accatgggga ccgtggggag actgctcaag 
gagagaatgt gacaacccag tcccaaagaa 
ccgctacagg tcctgtaaca tcgaggactg 
ttcagctctc agaaaccggg caaaggcggc 
ctgggaacat catcgccgtc gttctctttt 
aggcgcacaa tgagttttcc aaagcttcct 
ccaagtacgc cggcgtctcg ccaaaggaca 
ttggctactt tttcgtctta cagcccaagg 
ggagcctcag ctgggcttgc tgccatgcca 
atctgtgttc cgttttgaat ttggaatact 
gatttgttgt tgccttctgt tgggaaggaa 
ttgcatgtgt ttaatattgc ttctctatcc 
ccctctctgt ataactaggt ttcacataca 
acatggcacc caggagtttt agaaagtagt 
ccgcacatac gagcacacta actaaggtaa 
acatgtactt tgaccaaaga gactcgccat 
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tgcctctttt ttcttttcct tttttttttt 4380 
gccccagaat ccaatcaagc tcccttctga 4440 
ccagtgtcag tttacattcg gagaggaatc 4500 
tactaccctg tggtgcactg gcacctccgg 4 560 
cccttgggca gatggcacca gctgtggaga 4 620 
gaacaagaca gacatgaagc attttgctgt 4 680 
aactcagggt tgagaagggc aaagtgatgg 4740 
cctgtgtcct gtagtgggac tatgagatgg 4 800 
tacatttttc tctgattccc taaaatgtct 4860 
catttagact cctgttcatg gaagctgggg 4 920 
aacctgtggt ggtggagttc aatacacaat 4980 
cggagggaag tactgtgaag gcaaacgagt 5040 
tccagacaat aacggtgagt catactggac 5100 
gtgccacaac atgtggttgg aagttggaaa 5160 
caggaaaaac gttcagagag gagcagtgcg 5220 
ttgggaatga gcccactgta gagtggacac 5280 
ggtgcaagct cacctgtgaa gccaaaggca 5340 
taggtgcttt tacacttgaa tctttgcaaa 5400 
tacaaatgtt tgggctgtct ttacctattg 5460 
tctaaatgca ggaacaactc cttgctttgg 5520 
gcttaaatct agctagcact taaaagagtc 5580 
ccaaagaatg gccctttgaa aactcaagag 5640 
aaaattcatg gttagataaa ttatatatta 5700 
ccaaagtact tgttactggg tacctagcag 5760 
gagtttgaga attaaaaatt catcgttgga 5820 
ttcttttggt gttttgcaga aaggataaat 5880 
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gaactgtatt tcctgtttat agacgtacta ataaaaaaga agttgatgat gtctttagtg 9060 
gtaagattgt tactaatgtg gttggcaaat tgctgtaaag agccagatag taagcattta 9120 
tggcattgta ggctatcttt cctgccacaa ccatgtgaca gtgagtgctt tgtaggactg 9180 
agagcagcca taaatgacat gtaaatgata aactgtggct gtgctttaat aaaactttat 9240 
ttacaaaa 9248 



<210> 22 
<211> 5722 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 



<400> 22 



qqacqcacag 


gcattccccg 


cgcccctcca 


gccctcgccg 


ccctcgccac cgctcccggc 


60 


cgccgcgctc 


cggtacacac 


aggatccctg 


ctgggcacca 


acagctccac catggggctg 


120 


gcctggggac 


taggcgtcqt 


gttcctgatg 


catgtgtgtg 


gcaccaaccg cattccagag 


180 


tctggcggag 


acaacagcgt 


gtttgacatc 


tttgaactca 


ccggggccgc ccgcaagggg 


240 


tctgggcgcc 


gactggtgaa 


gggccccgac 


ccttccagcc 


cagctttccg catcgaggat 


300 


gccaacctga 


tcccccctgt 


gcctgatgac 


aagttccaag 


acctggtgga tgctgtgcgg 


360 


gcagaaaagg 


gtttcctcct 


tctggcatcc 


ctgaggcaga 


tgaagaagac ccggggcacg 


420 


ctgctggccc 


tggagcggaa 


agaccactct 


ggccaggtct 


tcagcgtggt gtccaatggc 


480 


aaggcgggca 


ccctggacct 


cagcctgacc 


gtccaaggaa 


agcagcacgt ggtgtctgtg 


540 


gaagaagctc 


tcctggcaac 


cggccagtgg 


aagagcatca 


ccctgtttgt gcaggaagac 


600 


agggcccagc 


tgtacatcga 


ctgtgaaaag 


atggagaatg 


ctgagttgga cgtccccatc 


660 


caaagcgtct 


tcaccagaga 


cctggccagc 


atcgccagac 


tccgcatcgc aaaggggggc 


720 


gtcaatgaca 


atttccaggg 


ggtgctgcag 


aatgtgaggt 


ttgtctttgg aaccacacca 


780 


gaagacatcc 


tcaggaacaa 


aggctgctcc 


agctctacca 


gtgtcctcct cacccttgac 


840 


aacaacgtgg 


tgaatggttc 


cagccctgcc 


atccgcacta 


actacattgg ccacaagaca 


900 


aaggacttgc 


aagccatctg 


cggcatctcc 


tgtgatgagc 


tgtccagcat ggtcctggaa 


960 
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ctcaggggcc tgcgcaccat tgtgaccacg 
gagaacaaag agttggccaa tgagctgagg 
cagtacagaa ataacgagga atggactgtt 
tcagttacca tctgcaaaaa ggtgtcctgc 
cctgatggag aatgctgtcc tcgctgttgg 
ccatggtccg agtggacctc ctgttctacg 
cgctcctgcg atagcctcaa caaccgatgt 
cacattcagg agtgtgacaa aagatttaaa 
tggtcatctt gttctgtgac atgtggtgat 
tctcccagcc cccagatgaa tgggaaaccc 
tgcaagaaag acgcctgccc catcaatgga 
tgttctgtca cctgtggagg aggggtacag 
ccccagtttg gaggcaagga ctgcgttggt 
caggactgtc caattgatgg atgcctgtcc 
agctaccctg atggcagctg gaaatgtggt 
atccagtgca cagatgttga tgagtgcaaa 
ggagagcacc ggtgtgagaa cacggacccc 
ttcaccggct cacagccctt cggccagggt 
tgcaagcccc gtaacccctg cacggatggg 
aactacctgg gccactatag cgaccccatg 
ggcaatggca tcatctgcgg ggaggacaca 
gtgtgcgtgg ccaatgcgac ttaccactgc 
tcagggcagg aagactatga caaggatgga 
aatgataaaa ttccagatga cagggacaac 
gactatgaca gagatgatgt gggagaccgc 
gatcaggcag acacagacaa caatggggaa 



ctgcaggaca gcatccgcaa agtgactgaa 1020 
cggcctcccc tatgctatca caacggagtt 1080 
gatagctgca ctgagtgtca ctgtcagaac 1140 
cccatcatgc cctgctccaa tgccacagtt 1200 
cccagcgact ctgcggacga tggctggtct 1260 
agctgtggca atggaattca gcagcgcggc 1320 
gagggctcct cggtccagac acggacctgc 1380 
caggatggtg gctggagcca ctggtccccg 1440 
ggtgtgatca caaggatccg gctctgcaac 1500 
tgtgaaggcg aagcgcggga gaccaaagcc 1560 
ggctggggtc cttggtcacc atgggacatc 1620 
aaacgtagtc gtctctgcaa caaccccgca 1680 
gatgtaacag aaaaccagat ctgcaacaag 1740 
aatccctgct ttgccggcgt gaagtgtact 1800 
gcttgtcccc ctggttacag tggaaatggc 1860 
gaagtgcctg atgcctgctt caaccacaat 1920 
ggctacaact gcctgccctg ccccccacgc 1980 
gtcgaacatg ccacggccaa caaacaggtg 2040 
acccacgact gcaacaagaa cgccaagtgc 2100 
taccgctgcg agtgcaagcc tggctacgct 2160 
gacctggatg gctggcccaa tgagaacctg 2220 
aaaaaggata attgccccaa ccttcccaac 2280 
attggtgatg cctgtgatga tgacgatgac 2340 
tgtccattcc attacaaccc agctcagtat 2400 
tgtgacaact gtccctacaa ccacaaccca 2460 
ggagacgcct gtgctgcaga cattgatgga 2520 
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gacggtatcc tcaatgaacg ggacaactgc 
actgatatgg atggggttgg agatcagtgt 
cagctggact ctgactcaga ccgcattgga 
gaagatggcc accagaacaa tctggacaac 
gaccatgaca aagatggcaa gggagatgcc 
cctgatgaca aggacaactg cagactcgtg 
gatggtcgag gtgatgcctg caaagatgat 
gacatctgtc ctgagaatgt tgacatcagt 
cctctggacc ccaaagggac atcccaaaat 
aaagaactcg tccagactgt caactgtgat 
aatgctgtgg acttcagtgg caccttcttc 
ggatttgtct ttggctacca gtccagcagc 
acccagtcct actgggacac caaccccacg 
aaagttgtaa actccaccac agggcctggc 
ggaaacaccc ctggccaggt gcgcaccctg 
gatttcaccg cctacagatg gcgtctcagc 
gtgatgtatg aagggaagaa aatcatggct 
gctggtggta gactagggtt gtttgtcttc 
aaatacgaat gtagagatcc ctaatcatca 
aatgctggta ttgcaccttc tggaactatg 
cttggcttcc ttcttttctg tgcttgcatc 
tcaagaaaat gcagttttca aaaacagact 
atcttccaag catataaaca attgctttgg 
agttgggaag gtgcccattc cactctgcct 
catctctgag cagtggactc aaaagcattt 
agaattagca aacaaaacca ccctgacatc 



cagtacgtct acaatgtgga ccagagagac 2580 
gacaattgcc ccttggaaca caatccggat 2640 
gatacctgtg acaacaatca ggatattgat 2700 
tgtccctatg tgcccaatgc caaccaggct 2760 
tgtgaccacg atgatgacaa cgatggcatt 2820 
cccaatcccg accagaagga ctctgacggc 2880 
tttgaccatg acagtgtgcc agacatcgat 294 0 
gagaccgatt tccgccgatt ccagatgatt 3000 
gaccctaact gggttgtacg ccatcagggt 3060 
cctggactcg ctgtaggtta tgatgagttt 3120 
atcaacaccg aaagggacga tgactatgct 3180 
cgcttttatg ttgtgatgtg gaagcaagtc 3240 
agggctcagg gatactcggg cctttctgtg 3300 
gagcacctgc ggaacgccct gtggcacaca 3360 
tggcatgacc ctcgtcacat aggctggaaa 3420 
cacaggccaa agacgggttt cattagagtg 3480 
gactcaggac ccatctatga taaaacctat 3540 
tctcaagaaa tggtgttctt ctctgacctg 3600 
aattgttgat tgaaagactg atcataaacc 3660 
ggcttgagaa aacccccagg atcacttctc 3720 
agtgtggact cctagaacgt gcgacctgcc 3780 
catcagcatt cagcctccaa tgaataagac 3840 
tttccttttg aaaaagcatc tacttgcttc 3900 
ttgtcacaga gcagggtgct attgtgaggc 3960 
tcaggcatgt cagagaaggg aggactcact 4020 
ctccttcagg aacacgggga gcagaggcca 4080 
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aagcactaag gggagggcgc atacccgaga 
ttacatgttc ggtactaagt cattttcagg 
tgctgactgg cgttagctga ttaacccatg 
ggagacaaag actggcttct ggacttcctc 
agtggccaga attagggaat cagaatcaaa 
ctggtcacat tgaaattggt ggcttcattc 
aataggaaaa cctaccatct cagtgagcac 
cttatatttt tatggttaca atggcacaaa 
ttctcttttt tccgtaatta ctaggtagtt 
ttttaaagtc tttacgatgt aaaatattta 
aggattattc atggaacagg aagaagcgta 
cttcgtgact gtaagattgt aaatacagat 
taggcttcat acggaaagtg tttgagagca 
caagaacagc acaaggaaaa tcagtctaat 
atgttatggg attccttttt tctctgtttt 
atttgcaaat gttttaaatt gcaaagaaag 
tcccttgtgc atatttccag ggagaaggaa 
aaaagagaaa aaaatgacaa aaggtgaaac 
tgactgagta aagaattttt ggatcaagcg 
gctactgtag tacctaaaaa gtcagtgttg 
ttcccaataa ggaaatagca ttgaaatgtt 
tctatcatct ggtataccat tgctttattt 
tagaatattc agattgtgta gatatgctat 
agagttagta tttctatttt tatataatgt 
tttctttttt ttgttttttt tttttttttt 
ctatttgcca ataccttttt ctaggaatgt 



cgattgtatg aagaaaatat ggaggaactg 4140 
ggattgaaag actattgctg gatttcatga 4200 
taaataggca cttaaataga agcaggaaag 4260 
cctgatcccc acccttactc atcaccttgc 4320 
ccagtgtaag gcagtgctgg ctgccattgc 4380 
tagatgtagc ttgtgcagat gtagcaggaa 4440 
cagctgcctc ccaaaggagg ggcagccgtg 4500 
attattatca acctaactaa aacattcctt 4560 
ttctaattct ctcttttgga agtatgattt 4620 
ttttttactt attctggaag atctggctga 4680 
aagactatcc atgtcatctt tgttgagagt 4740 
tatttattaa ctctgttctg cctggaaatt 4800 
agtagttgac atttatcagc aaatctcttg 48 60 
aagctgctct gccccttgtg ctcagagtgg 4 920 
atcttttcaa gtggaattag ttggttatcc 4980 
ccatgaggtc ttcaatactg ttttacccca 5040 
agcatataca cttttttctt tcatttttcc 5100 
ttacatacaa atattacctc atttgttgtg 5160 
gaaagagttt aagtgtctaa caaacttaaa 5220 
tacatagcat aaaaactctg cagagaagta 5280 
aaatacaatt tctgaaagtt atgttttttt 5340 
ttataaatta ttttctcatt gccattggaa 54 00 
ttaaataatt tatcaggaaa tactgcctgt 54 60 
ttgcacactg aattgaagaa ttgttggttt 5520 
tttttttttg cttttgacct cccattttta 5580 
gctttttttt gtacacattt ttatccattt 5640 
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tacattctaa agcagtgtaa gttgtatatt actgtttctt atgtacaagg aacaacaata 5700 
aatcatatgg aaatttatat tt 5722 

<210> 23 
<211> 42521 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 23 

gatcgttttc cagacatttt tgttctctgt tcatttcctt atcgtattca aaaagttatc 60 
acaaatgacc ttctctatct gtctgcgtct cttttaactc tcaccgtttg ggacctttca 120 
aatagttttt cgctatcaaa tctaaacatt agttgcgttg actcgacatt tgacccctca 180 
ctatcatctc cagttctctt ttttgttaca ctttagcagt ggcagcagag agcaagtagg 240 
tggagccaaa gtgtgcgcca ttcatcggga aaaattgtgt tcttcatcaa attttgggca 300 
atttactcgg gatttgcgct aatttggaaa caaaaattca aattcctgcc aattgttttg 360 
tttgcttttt ttcttttttt tttgctcctc ccatctctca tcaaattgct cttttttcga 420 
ttctaacata tcagccatct tcagagtgtg tcactaaccc ccatttttat tcaaggttag 480 
tgatatagta tcctaactac agacgtcaca ccatgaggtt gctgctcttc tcggcagccc 540 
ttcttctgtg ctccgtccca acgtgggcct tctctctgtc atcattcttc ggaagcgatg 600 
ttgcacaagt aagcaagctc tcctatacct agaatcttgt aaattgaaaa ctctaatttc 660 
cagaagccat accttcatcc aaactcccca ccggagcgtg acccggcgag ttccagaatg 720 
aagagacagg catatcaagt gtacgttgat ggagatgttt ccgttactgt tgacaagtct 780 
ggacaaaagg aaaccggcaa ctggggacca tgggtgcccg agaacgagtg ctcacgttcg 840 
tgtggtggag gagttcaact cgagaagaga cagtgcaggt tcgtggactt ttcatttttt 900 
agggaatttc ctagacgttc taaaagctta ttttcaaaaa ttttggtttc ctgatcttca 960 
tgcctttatg aacgtggtga aagatcaacc taggctagcc tgtgacatac attttttgaa 1020 
gcagatccaa ctttatcaag agccatcgaa ttctcgtttt aaagtgtttt ttttttctga 1080 
taactttttt ctaatagctt tacccatttt tatgtcaaga ctgaaagcaa tgaatcacaa 1140 
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gaggctatct acgtttgttt ttgaagctct gtaggaatca tcttaaaaaa ttaagtaaag 1200 
taatggagat gaaattctaa ttttttaaaa tcataatcat tactttctgt attatcttca 1260 
agttcaaact tttcaaacgg ttattctcaa gaaactcaca tagaatttta acaatttcct 1320 
ctatctattt cttgcaagca acccaccgaa ctcaaatctt atccaaacta aacttttagt 1380 
ggtgactgca ctggagcttc agtccgctac atctcgtgta acttgaacgc atgcgagtct 1.440 
ggtactgatt tccgtgctga gcaatgctcc aaattcaacg atgaggctct tgatggaaac 1500 
taccacaagt ggactccata caagggaaag aacaagtaag ttaactttct tcaagatgtt 1560 
tttctaattt tcgagttttc aggtgcgagc tcgtctgtaa gccagaatct ggaaacttct 1620 
actacaagtg ggctgataag gttgttgatg gaaccaagtg cgactccaag agcaacgata 1680 
tctgtgttga tggggaatgt cttccagttg gatgtgacgg aaagcttgga tcttgtaagt 1740 
ttaaaattta attcaaaatc ttcatttcat gccgaatatt tcagctctca aattcgacaa 1800 
gtgcggaaag tgcgatggag atggttctac ctgcaagact attgaaggac gtttcgatga 1860 
gcgcaatctc tctccaggat accatgatat tatcaaactt ccagaaggag ccaccaacat 1920 
taagattcag gaagccagaa agagcaccaa caacttggct ctgaagaacg gttccgatca 1980 
cttttatttg aatggaaatg gattgatcca agttgagaag gaggttgaag tcggaggaac 2040 
tatcttcgtt tacgatgacg ctgaaccaga aactctcagt gctcaaggac cactctccga 2100 
ggagctcacc gttgctcttc tcttcagaaa gggaagccgt gatactgcta tcaagtacga 2160 
gttctctatt ccacttgagg aggaagttga ctacatgtac aagtttgaca actggactcc 2220 
gtgctctgta tcatgcggaa agggtgttca aacccgtaat ctctactgta ttgatggaaa 2280 
gaacaaggga cgcgttgagg atgatctctg cgaggagaac aatgccacaa agccagagtt 2340 . 
cgaaaagagc tgtgaaactg ttgactgtga agccgaatgg ttcactggag actgggaatc 2400 
ttgctcatcc acctgcggag atcaaggaca gcaataccgt gtcgtctact gccatcaagt 2460 
attcgctaac ggacgtcgtg ttaccgttga ggatggaaac tgcaccgttg agagaccacc 2520 
agtaaagcag acttgcaatc ggtaagttga ttttataaat gcataaacaa ctctgtgaat 2580 
ctatttgttt atgcgatgct atccatatat attaccagat ggtgttggtg cccaaaactt 2640 
ataaacaatt attttctctt tgcagttttg cctgcccaga gtggcaagct ggtccgtggt 2700 
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cggcttgctc agagaagtgt ggagacgcct 
agaaggaagg agaagaggga aaactcttgg 
agaagttcga cacagagaga acttgcaatt 
ctggagaatg gaacttggtt agattttgca 
aataagatca actttatgaa acaaataatt 
agactcgtga agtcacctgc aaggactccc 
tggttgataa ctccaccgag attccaactg 
tgaacttgga ttaatttttt tcttccagat 
agtggaccgt cagtgagtgg agcaagtgta 
gtcgtgttat ctgtgccatc caccaaaacg 
gtcaagctga gaagccagaa ggaaagacta 
catggtacac atcttcatgg tccgagtgta 
gtgtcgctgt ttgcttgaac tacgataaga 
tcaagccatc tgagaaacaa gattgtaacg 
agttcggatg ctgcccagat aactctactt 
ctaactgctc ggaaacagaa ttcggatgct 
ctaactccaa gggatgcgaa gaattcgttg 
ccaatgctga cgctgaagct tcaggagatg 
acggagaagc tgttgatgtt gagtgtgcca 
atggggaact tatcggaaat gatactgatg 
ccgaattcgg atgctgtcca gattggtaca 
gcccatcgtt cactcttgga ggatgtaacg 
tcactcttgc tcgtggagcc aaccttgaag 
tctatggatg ctgtaaagat cgtaagacaa 
agcgatcatc cttcccatgt gagcttagcg 
ctgctcttgg aaagaatgga accggatgcg 



tccaatacag atcggtgacc tgccgcagtg 2760 
ccgctgatgc ttgcccagct gatgagcaag 2820 
tgggaccatg cgagggactt acatttgtca 2880 
aaatatgggg acctggggaa aagcatacta 2940 
tttagtgcac ccgctgcaac gatactgagg 3000 
aaggaagagc ctatccactc gagaagtgtt 3060 
atactaggtg agtcattcca gatatgacat 3120 
catgcgccac ccaaccacca tgtgagtacg 3180 
ccaccgaatg cggacacgga cacaagactc 3240 
gaggactcga ggttgttgat gaaggacact 3300 
actgcaccaa tgaggagaag tgtactggaa 3360 
ccgctgaatg tggtggtgga tcccaagatc 3420 
agccagttcc agaatggtgc gacgaagccg 3480 
ttgatgactg cccaacttgc gttgactctg 3540 
ttgctaccgg agaattcaac ttcggatgct 3600 
gtgctgacaa tgttaccgtt gccactggac 3660 
agtctccact taaccttgaa gctgatgttg 3720 
ctccagaact ctgcagcgtc acaaacgaga 3780 
ccattgctcc aatcactgct cttcttggag 3840 
cttccaatga gaccatacac tgctcgaaga 3900 
ccgccgcctc tggaaagggt aacgaaggat 3960 
agactcaatt cggatgttgt cacgatgatg 4020 
gatgcggaga gccatcttgc gctgcttccc 4080 
ttgccttcgg accacactat tctggatgtg 4140 
acttcggatg ctgcccagat ggtgagactg 4200 
gagagaactg cttgaccacc aagttcggat 4260 
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gctgccctga tggaaagacc accgccaagg 
tcgcccaata cggatgctgc ccagacggaa 
gatgcccaga aagctgcgcc cagagccagt 
ctcgcggaga gaacaaggaa ggatgtccat 
atggggagac tactgctctt ggaccacgca 
agcacggatg ttgcccagat ggagagacca 
caccaactac cacgccacca ttcctcatgg 
cctgtaatca gacacaagaa agtggaaccg 
attaacctca tgaaaaagaa ttggagcaac 
gcatggcatt atgataccac tgagggacgt 
ggaaatgaca acaactttgc tagccaggat 
ggcaagggaa gatgttacct gccacgtgtt 
ccaagatact attatgatca ttccaagaag 
ctcggaaatg ccaacaactt caactctttc 
ggaccgtacg atgctccaac caccgctgct 
caataccttc caactccaga agttcaacag 
ccacaacagc cacaacaaca gcaacagcaa 
tcaatggaag acatctgcag atcccgccaa 
caatggttct acaacgcttt cagccaagaa 
ggaaatctca atcgtttccg cagcaaggat 
ggagctcagc catccgctgc ccggcaggaa 
ccagctcagc caagtaacat cgtctctcca 
ccatgtaagt tctttagaat gcatttattt 
gaagcatccc catttcagcg aacagcaaac 
aaggacgttg taagggggct tttgactcct 
tcacattcaa gtacaccgga tgcggaggaa 



ggtcccacaa cgagggatgc ggatgcgagt 4320 
aatcagttgc caagggagcc ggattttacg 4380 
tcggatgctg cccagacgga aagactcgtg 4440 
gccagtacac ccgttacgga tgctgcccag 4500 
atgatggatg tgataactgc cgctacgcca 4560 
aggctcttgg accagatgga gccggatgcc 4620 
gaggaactgt tgccccacat aaaatcgccg 4 680 
tctgcggagc cggatacaag cttgtaagta 4740 
acatttcatg tataaatatt tcaatttcag 4800 
tgcaaccagt tctggtacgg aggatgcggt 4860 
atgtgcgaga ctatctgcgt cgaaccacca 4 920 
gatggaccac tccggtgtga ccaacttcag 4 980 
cactgtgtgg ccttctggtg gagaggatgt 5040 
gaagaatgct ccatgttctg taaggacgtt 5100 
ccaccaccac caccacagca aaatgctcag 5160 
attgagattc aatctgctga gcaacctcaa 5220 
caacagcaac aaccacagca accacgtcaa 5280 
gacgccggac catgcgagac ttactccgat 5340 
tgcgaaacct tcacttatgg aggatgtgga 5400 
gaatgcgagc agcgttgttt cttcgttcac 54 60 
caagctcagc cagcagctca accagctcaa 5520 
ccacaacagt cagctagtcc agttgtggtt 5580 
cttactataa gtttctataa gttcgcatgt 564 0 
aacgcgatgc ttgccacctc aacgttgacc 5700 
ggtactacga agttgccacc ggatcctgcg 5760 
acgccaacag atttgctagc aaggatcagt 5820 
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gcgagtcact ctgtgtgaag ccagcttctg 
gagttataga gaatgttcac tatttttgtt 
gtttgaaaat gtttgcacca tgtttcaaaa 
aaaatcttaa ttacactcca gaagtacatt 
aaaaaggttt aatattgagg tttcaaatta 
aaactgattt ttaaaatctt atttttgaag 
ccaacttttt cagacggtgc agctggaatc 
ggaccgtgca ccaactttgt cacgaaatgg 
cgattccatt acggtggatg ccaaggaacc 
aaggctgctt gtcaaaatca taaggatgct 
tctggaaagc attcctatta ttactacaac 
tatggtggct gcctcggaaa tactaacaga 
tgcccgagta agttctaagt taatagtgat 
aattttcaaa tactttttgc ataattacct 
ctccacccgc tacaaattgt ttcccgtact 
acacggcaat gctgcctaaa tgaactgcct 
aaaaaaaaca attctctcaa aaaattccct 
gttgcgctat tttttttaaa taaatgaata 
gcatgtaagt ttgtagtgta tcagtgtgtt 
tttcctgaac tgcatgaaac tgttcttatt 
gtgtgatttt gtttcagaag acgaccagac 
aagtttgcca cttgttcaag aagatcctca 
acgtgtagtc caagtgccta cttctcgtat 
tattaaggaa tcagtagcat gtaaattgtg 
ttctcactca caatcagata tggagtagct 
tgtcataaca tccaaattta attattaaaa 



aagctgcttc agccggaatt ggtatgcttt 5880 
aaatgtttga gtaaatgaga aactggctca 5940 
tagtttttga gttgaatagt tgaggccatg 6000 
ttaaaacatt tttgagaatt aggtcttcaa 6060 
gaaatattaa tatacgggga tttgggttta 6120 
tttcgctttg atattcgtgc aaaaaaaaaa 6180 
aactcagttt gtgacgaagc caaggacacc 6240 
tactacaaca aagccgacgg aacctgcaac 6300 
aacaatcgat tcgacaacga gcaacagtgc 6360 
tgtcaacttc caaaggttca aggaccatgc 6420 
actgccagtc atcaatgcga gacgttcact 6480 
ttcgctacca ttgaggagtg tcaagcgaga 654 0 
atatgctttg tttccccttt attctttgac 6600 
tatttctatt cccttctgtt tcccattttc 6660 
ctctcctttc tcactttccc gtccgaaggg 6720 
aataatattt atgaattttc caattttcta 6780 
gccgttccgc cactgctttc ttcacccatt 6840 
aagctgaaat agttaacagt ttctgaaatt 6900 
tgtcgtgaaa gttttttttt acctgcatga 6960 
acgttttaga tttgctgaag tgtgctagaa 7020 
tacaacaaca tcacaaccag aagagctccc 7080 
gccacgaccg gcattttcat tgaagtaagc 7140 
gaccaaaaaa tttaatataa ggtttccaag 7200 
tggattgttc tcctgggttg atgggttttt 7260 
tatatgggaa tttatttgag aaatagaata 7320 
agttgtgaag tttctcatta tgtatataaa 7380 
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attcgccttt caaataagaa caaaaattaa 
ttgagaaaat aactggttca aaaagaagaa 
ggattttctt ttcaggtcgg ggaaatttcg 
aaatatacac caaaaattat ttttaccata 
agtattcatg taaggttact gtattgttac 
agttgtaaga agtatagaga ttactgtaga 
aataatctgt ggggatagaa tgttgaaggc 
ttttaacagt gattttttta agcatatcct 
ttcccactaa actctttaag acaatccttc 
gcacgttccg tatccgcccg tcaccatact 
tatgctgttc cagatccagg atcttgcggg 
taccgtctta atgattcaga gactaccgtc 
catgccgtca attctactat ggtggatgtg 
ataaatgtga aacatcgtgt gttgctaaga 
cttcaaaatc tctggaagag gttagactaa 
atcatgatcc agaagttgat caaatcgaag 
gagctctacc tgaattatgc atgcttccag 
tgagatggag gtaagtcaaa tcaagaatag 
tatactaaaa gcaaaatctt aaaatctttc 
accttcatgt attctggatg taatccaaat 
attattttgt gtcctttcgt ggaactggcc 
gaatacccaa tttgagcacc gctggctcac 
cggcctctgt attttctgag cactcttgag 
ttcaagaggg agtgagagag agaaataaac 
tagaagtcga tgccgttctg cagtatcctt 
ttatggatta ggaactaaga aatcatcact 



ctgtatgaaa gagctgaatt caatttgaaa 7440 
aaacgttgga aaatctagac. gtaaatctat 7500 
acgattttta tattttcaaa aatcattcac 7560 
ataaaatacg gaatttcact ggattactgt 7620 
tctagggata ctacaagaat atttttgcaa 7680 
ttgaaaatct agacaaaaat cattttccgt 7740 
acaaggctta taaagcacca tgggaaaaaa 7800 
ctttcccagg aaatccactt ttcaaatata 7860 
gcccatagtc gtcgccgtga tgctccattt 7920 
cctgattccg aagaggaacg agttgactgt 7980 
taataaatct cacctatcca ttacaaccat 8040 
ttgtttggca ctactctgcc acgagtaact 8100 
ctgggaatac gaatcgcttc gagacccggg 8160 
ttgaagaacg cgtggaaagt gtgtcagaag 8220 
cggatccaag gatggattct cactttggat 8280 
aagaagctga atatgtcatt gttgataccg 8340 
aacaaagagg gtcttgttat gataacattt 8400 
aaaattcgaa aatccgaaaa actttataat 84 60 
agattcgact ctgaaaagtc tcaatgtgta 8520 
gcaaatcact tcactagtca ggttagtttc 8580 
ccttggtttc taacttgatc ttctccttcc 8640 
tttttcgacg gtgacgttcc tcaattctag 8700 
caacagtttc ctcactggaa atgtttgttt 8760 
gtacaatttt tgaagccgca catgatttgt 8820 
catgtttcgt agttgtttct gtagtaattt 8880 
cactgcggta gttgcatttt tgtgcatgca 8940 
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tcttcccata aaagcaacaa atgcaacaac tgatagagcc gccacacaaa ttgcaataat 9000 
tcgaagtcga tttctaattc ctttctttac tttttgtcta tgctcagctg ctttttcgat 9060 
gtgcttcttc ttgctggggt cgagctcgca atgaggaaat ggttcgatga gtggaccgtg 9120 
ttttttgcat tgttcacaac ggcgtccagt gtatttgtct gggcagtcac acgaaagagt 9180 
gcggtttttg aaatctgaaa attttaaatt taagaacagg atctatagca gttttgccca 9240 
tcacagtcct atgtctatat taaaaaaaat tatcggacat taaaaaaaat gttttctcat 9300 
tttttcagta tttctataaa aactgcattc gcatttaatc ataactttta atcgttaaaa 9360 
acttagtctt taagtacctg gggatccgta aacacagaca atttcatcac aataatcgcc 9420 
ttcaaatccc acatcacaga tgcatcttcc atttctcaaa aacccctcga cacatttact 9480 
tgtatattgg cattcacttc caaaatatga gcccacacat tcacatcggt cccccttcca 9540 
ttctcctttg tttccgcact gaaataattc aatagatttt ggaagtttag ggcctcaaaa 9600 
atataccttt tccgctggcc gatagtcaca catttcacct ttccatccga cttcgcaaat 9660 
gcacggctca ctgaatagaa ggctttccgg gtcgaagcgg aatccgaccg agagtccgtg 9720 
aactgtaaat tgaaaatttg taattccaaa aaaaaaacag cttttgcaaa aatcgtccaa 9780 
aagaatttta gagttagaca ttatttttct caaaaagttc aaagttgtat cagttttaaa 9840 
ataaaatatt taataggatt gtagagcttg ttagaaaaaa taaaagctac ttgaaaaaag 9900 
aaaggtatcc aaaaaggtat tgagatagtt tcaagcaact ctatttgtaa actgtcgagt 9960 
ttttaagttc tacaaatctc ttataacatc gctacatcta ctatcaaact ttgaaaaaaa 10020 
accataccac attcaaaatg ttcacattta tctccagtct gtcccttgat acaatgacaa 10080 
atccctccag catagattcc tccattacga cattcggctc tcggatcatc cagagcaaca 10140 
ttgtctagaa tacttctctt ttgaagaata cgatgcacgt cgctcaatat attttcatct 10200 
agatctagtg agtcatctcg tgattgtgct tttgttgttg ataaaaatag gaagagtaaa 10260 
gtggaaaatt gtaaacagta catagcgtta gatactgaca agtctactat caattgattt 10320 
atttattgcg tcttgaaagg ggtatcaatg agagaaatag ggagatgggt aaaatgcatt 10380 
tataagagaa tacaaaagat gacgtaattg attaatcaga gatcagttga aaatactttt 10440 
aagtatcaat tattatctgt gaagacagtc acgtgactct gactcgaact caatttgcat 10500 
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gttgatagtt ccaatgttaa agaaagtctt 
tggaatatta aacgtgactc ttctctgaca 
gatattaagt tttggatcaa tagtcatcac 
accaagagag tctgtagttc cttcggcaag 
cttttatttt gaaaaatttc aatgagttgc 
gttagcaatt ggtatttaaa tgttcaagct 
atataaaaac tttgtgtaag attcggtcta 
aaaaaacttt aatttcaagc ttcacataat 
tcaccttcgt gttagccagt ggtttgtctc 
attgggtgaa tacagcttcg gagccaattg 
cgagtcgtga cattattttt gtctgaaaat 
agaaactctc ttatagcaca gttggttaga 
gaattgacca aacaaaagaa ctggttttca 
acagagaaga tcggaagtga ttcatctgga 
ttttccactt atttatatac ccaatagaat 
tcgctgacta acgtgtgctt cttcggtttt 
agaacaaagc aatcataagg tcgaaaatcc 
tgtgcggcat ggcatagttt tatcttgctg 
gaatggattc catccgatcg ttgctccacg 
atacccgttg tgtatggctg gcaactgtga 
atacagttta aaataatttt ctaaactaaa 
catgtcgggc gcaggcacgc taacgtgaaa 
tttcaattaa aatacaatca actacacaat 
aagaatttaa aaaaaacatg gaaatttaaa 
tatatttcat tcgataagct tccaattcgg 
tctcatccat ctcttgttga gctgtcattc 



tgggttttct ccagatgaaa caaatgattt 10560 
aggtttgagt ccgtcatcac aatcgtgata 10620 
ttcggaagtg tgtccggtaa gaaggaattg 10680 
aagatcgtca agatccggtc ctgaaaaaaa 1074 0 
ttcatgttag aatttggaat ttttaaagat 10800 
aacgtaatta gagttattca aacaagcttt 10860 
attagaacat caatttttaa cgcagctgat 10920 
tctacttacc ggtatcatca tcgtagagct 10980 
cacacatcag aacaccctta actccagctg 11040 
cacaaagtat gaaaagtgat gaaatgcacg 11100 
acaaacactg actgatctga ccttcatcgg 11160 
aaaagatacg gagaggagaa gtgggaaatc 11220 
cttgaaatag aagacgatga aagatataca 11280 
gaagaaaatt gagaggagca acttcttgta 11340 
tcacctgatt ctttccgatt tgtgtacatt 11400 
gtcatttctt attgttcatt gaaaataaac 114 60 
catttagaga tcaagaggtg tacctttaat 11520 
aactctcacc aattgatgag tatgtcagta 11580 
gtgatctctt ccgccgcctt ttcatccacc 11640 
acagcgcctc agtggaatgt ttagtttgat 11700 
gaaatcagtt tttgaaacca gtcttgtagg 11760 
aatagaattt cgagtggtta actattttat 11820 
gaatgacccg gataaatgaa atacaaatac 11880 
cttttccatc atctcccttt gctggaatat 11940 
cttttctctg atcggatcgt acactgtgtc 12000 
tcttctcatt ccatttctga gcttttgctt 12060 
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ttttgtggat tctgttgtat tctttgcact 
gaattacagc aatgattcca gcgcagatgg 
cccagcagac attgtatttg acatgcttga 
gcatacatct tttcgttttt ccaccacacc 
aattttcaat atgtttgctt ataaaattga 
tcaacttctg aaattaggaa agtattctca 
agtttagaac tacaggtgtg aaaaaatctg 
gtttacgcgt caatttattc gagtgttcat 
attaagaaac ttgtttcaga catttttgct 
acgcaagttt ctttgaaaac aaaaacaaaa 
ccaacacatg tttcacaagt gttcaaatcg 
tctgaccatg tcatgttatc cgcacatact 
aatcggtgaa tcatcgttct gaaaaatcaa 
actgagcgac aaaaccaaag tagtaatgaa 
ctcgcgtgtg tttgtctatc gagtgccaat 
ggcagaagaa gaacatacga actatgtcgg 
aaagaaaatg catttgagaa aatgcaaatg 
gaatggtaaa aataaaggtg gatcagcaaa 
tgaaagataa caaaaaaaaa taaatacaga 
gtttttgtat agtgcaaatt cagtagttgt 
tttagagtgg gtcttgtcac gatccatatt 
aaataaagaa actaggcgcc aagaagctat 
tgaattacat taaattccaa caaaatagga 
ttcaaaaacg tttttaaaat tgttcataag 
ttcagctttt ctctctctct ctattctcta 
caaaagtgga agtgaatgag gatggaaata 



-54- 

tgcagcagca ataacagaag caggcaatga 12120 
caataacgat tgcggcggct gatgtgctca 12180 
tattacagtc tggatagtac cagtcgaatg 12240 
agaagcaatt ctgaaaaaat gtgtttttga 12300 
atttaatttt tcaaacagtg tttcagaaac 12360 
attgagagct gtttttgtat taaaagtttc 12420 
agcaagtgaa caccaacgta ttgcatcaca 124 80 
tgtagagaaa gttaggtcac cttccagaaa 12540 
cttttagagg aatttttttt tagaggaaac 12600 
tatatttttt atccacttac cgagcccttg 12660 
ttcgatccaa ttctacagta ttcttgtttc 12720 
gatactagaa caattgagaa aaagagtagt 12780 
taaatagtaa caacttgagc aagtctcgta 12840 
atagaaagat agaaaggtaa actcaaaggg 12900 
gagttttagg agtagcgaca gaaataagtt 12960 
gctacaagat tcttgtgttt actttttgaa 13020 
ttcggcagaa atcgaatgga gtttagagca 13080 
aatagttgaa caaatatttt gtagatttca 13140 
aaacaatata tgacgtattt ttcaatcatt 13200 
acctgttata agtacagcga agttatacat 13260 
ttttgaacgc aatatttgaa atccaaaaaa 13320 
agtagctata cgcataaatt gtgaatacct 13380 
aaatcatata aaaacgaagt tagttgtcaa 134 4 0 
cgccgagctg tccccctcag ttttcgttta 13500 
tcgtcaccta tatttcatag tccccttatc 13560 
tgataccgca tgcttcaaaa aaatttgctt 13620 
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atgagaaacc aacatttgaa aatttccagg 
gaaatgtggc agtgtgcgag ttgccggccg 
ggtatgtact gttgacacat tttacaaatg 
atgttgatgg caaggtttta aatagatgta 
acataaatta ggattaaaac aaaaatacta 
attccagaat ttgaaaatgt ttttctctgt 
aggctgaggt aggtaggctg tagaaagtgc 
aaaatcactt gtaagcacat gaaaatcaca 
tataaagaag atttacattt taataataat 
tcccaatgtc aaatgatgat gtggactgga 
aaagcagact gtgaatctct ttgccgagtt 
tgtacattgg aacgatcggc cggtccatgt 
tcaactcatc tcgattgtaa gccattcact 
ttcgttagca aagagcaatg tcagcagagc 
atctgcacac tccgcccaga gccgggaccg 
gacccggtga tccaatcctg tcatatgttc 
cggttcgatt cagagttgga ctgcttccga 
gaaagcgaga gagtgggaca gctgacgtct 
aaaacagcga tttttgttgg aaatactgta 
atatttaaac tggtcccatg agagtttggt 
taacttttca tgaacacttt agccgatttt 
tcaaaataaa aaccttacgc cactccactt 
tccgaatccg atgcaacagt tacggagtgc 
tcctccagtt cggctcgcga atcactgaag 
ctttaactgc tgacgccggt gtctacactt 
agggagtcga ggttgtgatc aagagacttc 
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aaacttgtga acgagcctgt ggtaaatgga 13680 

aacacggaga ttgccaactt gcgattccca 13740 

ggatgggaag tggtcggtga tcaggtggaa 13800 

gtaactgaaa acaaaatgac agatgtacat 13860 

tgcggagtca ggtgactaat ttttctggaa 13920 

ttgaaagtag aacgggacct tttacaaaat 13980 

ctttggtgtc tttgtaattt ttgttttcaa 14040 

tgaataatga tgtaaaattt agaaaattag 14100 

aattccagat ggtaccatga cccaaaaaca 14160 

tgcggaggaa atggaaacgc gttctcttca 14220 

gagacattat ggtccaacaa cactgacttc 14280 

acagattcta tttcaatgtg gtatttcgat 14340 

tatggaggtt gccgtggaaa tcagaatcga 14 4 00 

tgccgtcctg gagacacaaa atctgaggat 14 4 60 

tgtcggctgg gactcgagaa atacttttac 14520 

cattatggag gttgtgaggg aaatgcaaac 14580 

cgatgctcga gtgtcaaggt tgaagcaagt 14 640 

gcatccacgc cagttattta tattgttaac 14700 

agttattaat tttaattcga agatttctta 14760 

tcattttccg acaatagact gcaaaattga 14820 

agctagtttt gtttattaaa atttggtaat 14 880 

ttgaatactt .gtcaaataca ttttttcagt 14940 

ttccaataac atggtacaag aacggaggtc 15000 

agaatgatga cactttggaa attgtggatg 15060 

gcattgccgg ccaggatagt acaatgagcg 15120 

ctggtcacag aactacatct cgtccaatgc 15180 
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tgacaccatc caagaacttc tccttgggaa 
ctacaacacc cttccgaatc tatacgcctg 
gcccgacaag caattcctgt atggatgtgg 
agaacggttt gtgcgggaag aagcgatatg 
ttcataattt taaattttaa gtttggattt 
tgttttctcc ttcataatat ctccattgcg 
ttcccctcag acaattggct aattactcgt 
cttcttccta tactttggta tacataatca 
aaaaaacaat ccacttgaaa tccattcagg 
gtgccattac ttccacttat tttgcctatt 
gtctaactct gctgattgca agcttcccag 
tacgccttcc tagtgggttc acgtttcctg 
cgtatattcc agcgacggta aattttcgct 
attttctagg acaaaaattt aaagtaggct 
caacaggcta gcttttggag agaaatcaaa 
aagcgtccga aagtttttgg gaatccgcta 
aggtttattt tgcactatta aggcgtgtag 
agccttacaa tcaagtatta ttcatgaagg 
attacttata caaaaacatg aaaaatttca 
agctccacca cgctatttaa ctggagaaca 
atctgatttt ctaaatttaa cttcaagctc 
ctgagtacaa gcaagcgatg gccccacaac 
ctcctgcacc aatggtccca ccggttactg 
tgactacgat tgcatctgtt ggaccactta 
gaaagtttgt gattttagtt aattgatctt 
gaggaattgc tcaaaacttg aacgacaggt 



ccccaccgac accatctcca tctacagttt 15240 
gatctgctcc atctgatgct cgtgtaagcc 15300 
gtaacgcgag cacgtgcgat ttgatcgtga 15360 
gaacattctg ctgtcacact tgcacccggg 15420 
tttgatttca aattttcatt aatcttttaa 15480 
agatctcttt ttcccttctc ttcctatact 15540 
tcgttccagt aaataaatat gaatttattt 15600 
tggcatgaaa tacaagacaa aaaaaacaag 15660 
tgtgaactaa catcttactc tattaacttc 15720 
cactaatgaa gtctctgaga attattttct 15780 
ctcagcggag ccgccgaaaa cagaaatttg 15840 
cggatgcggc gagtaatttt caaagagatg 15900 
ttttgttaaa tgaatttcag gcttcaaatt 15960 
tgcgcatact catttccctg ccttacctgc 16020 
agtttggtgt ctgtaaatct aagctttccg 16080 
tacactttaa gattgataaa tatttgaatc 16140 
gcactaggcc ggcaaagctc gcctacgggg 16200 
tcttgatttg gttacagaat tccatctaaa 16260 
gtttgccccg ccatctgaga agattcttca 16320 
caatccagct tatggtaggc ccaatttttt 16380 
acaataccga tgtgcaagga atgaactacg 16440 
cacatccagt cgatgcttat tctccaccac 16500 
tagttgaacc acctgcaatg ccgtatgaaa 16560 
ctactcccgc atcagtcggc ttgaagaagg 16620 
tcaagtaatt ggatacaatt tccagcatcg 16680 
acaccagctt aacaccagaa gctcaacgtg 1674 0 
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ctcagaaagg tcatacctat acggctctgg gcggtggaca attctatcaa agtttacttg 16800 
gaggggtaag atgcaaggtt agaacttaca aactcaattc attttacaga aaggaggccc 16860 
cggaggattc tccccactct cgttctttct aaacggcggt ctaggaggta ctggtggtgg 16920 
tggtaacaat ggattcttcg ttccggtgcc tgtagtcatt ccgcctccac cgccaccgcc 16980 
accaggacca aactgtttca cgaacccgtc gggattcctt tgctgtaacg tgacacttga 17040 
gaaaactatg gaagacgcgt acctggccgc aaaagcagat ggtgcatcac tgtgcaatgt 17100 
acagaaaatg gcaactgcag tgcaagcggt ggggtttatg gatttcattt tataatgtaa 17160 
tgtgctcttc cctagaattg aataagctta caacttgaat tacgacttga attacaactt 17220 
gaataagctt aaaatatcca ccaaatttca gcaagccgaa aaaaaattcg gaacaacttt 17280 
cgaatcagtc gctgctcatt cggacttcgt cgcaaaaatt aattttgccg gtgacctgaa 17340 
ctgtaaaata gaaatcgatg ggaaattcat actagcgtac gcaactccaa tcgccgagca 17400 
agaggtgaac attgtcgatg ctagctcatt cttctcggga gctgctgata aggatttgga 17460 
tggtgtcaat ggtaccaagc ccacctacat tgtctacggt cccattaaat aatggagggt 17520 
ctagctttaa agatttctgt atattaaagc tgaaatgtga attaattgtt tatttgccaa 17580 
tcacaataaa gttggaaata tcatttgaat agttcgaaag ttttcaatcg gaatgggaga 17640 
aaattcgaaa atttaggtgg aggtgaaaag ttgatgaagt aacacaatta actgtgctcg 17700 
aatcctgaat agaaggagaa aagagcctat aaacagattt tcaatttaca catattacac 17760 
aacaattcag gaagaagaca gtagttgcaa aagaaaatac gtagaaaaaa gagtgaagga 17820 
ctggcgggat gtcagtttgg atgtacaaat agaactcctg aagcataaga aacagaagaa 17880 
tcgaccgatg atcgaacctg aaatggattt attgttgatt gaaaaatatt aagcaattct 17940 
gaatctctac cttgtttgat tgtgtgtaat gcaagaatct aaactcgtga gtgtgattgt 18000 
tactgatccg gaaatgttcg gctgcttgca gcattatcaa tatcggatta cgcccacaaa 18060 
tcgtgttctg ggtctttttg aggtagtcat taaaagctgc cggattaagc gtctcaattg 18120 
cgctcattcc ctgcttatcc atattggtta tctgctcata aatcggaata gaactatgac 18180 
gatcgtacgg agaaaagctg aagcgttctc cccaatggca aaagtccgaa gagatcacaa 18240 
acaagtttct tggatcctcc atgtaatgag caaaaatatt tccatacgtt tgctgcctag 18300 
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atcctggtaa agatccaaca agtaccggaa caatggtgta acgttttgaa cccataacct 18360 
ttgcaataaa tgggagttgc atttcaatac tatgctctga ttcttcatct cggcgatcca 18420 
tcaaatcgaa atgacgagtg gcacgaagct cctcgttaac tgcaaagggc aatgttgtaa 18480 
aagatgtact aagagtgcaa tagattactt ttgtgatcaa cgatcaagtc gccgagtgga 18540 
gttctgtact tgctgcatgt ggttatagca catccattta gagcaacaac gtgagatggg 18600 
ccaagaatga agactctttc actgaaagtt attgagtaag ccctgttgcc aagtacaaat 18660 
ttcaacaact cacactgctg atgaaacaac ttgtttgaaa gcatatgcag ctgtttctcc 18720 
acaatacgaa tatcccgcat gtctgaaagt tatcagaaaa taaatattaa atgcatttag 18780 
agtattacgg tgaaatcaac gctcgagccg ttccaatccg tggaccggcg ttgtcaagcc 18840 
attttgtgag ttgccgatca agatctcgct ggttggcgtt gtaccatgat ccggcatgtg 18900 
aggcagatot cgtgtgctcg ccgaatccgt ttagtgacat tttaaattca gatggtctga 18960 
atattaaagt tttgataaat tgttgtatac gacttgatta atatgtttag tagggttttc 19020 
aactactgtg tgtttcccaa atagtcaaca ttgaaaaatg gaaaagtttg aatttaaata 19080 
ttcaaataat tttaattaat taatattaaa attcacaata cagtgtaaca tcacacttaa 19140 
ttcaagatgt tctaaaaata tgagccatcg ggctagctct acttcacgaa ttcgaatcaa 19200 
gtccggggaa ctggctcgaa agaaaataaa tttttaattt ggtttatgtc cgaaatagaa 19260 
atgggaatct ggtttttcat tctgaataat ttccgagaaa cacttacaaa ataaaattca 19320 
gatatcttgc aaaaggaagg ccaaatgtcc tgagaaatag agcacgagag ttttgaaata 19380 
cctgcaacaa caggatttgc ttctattttg ttttttgaac tgaattttaa actattatct 19440 
attctgaaaa cattttttgt ccaaaaaaaa tcaagaacaa tttagagcaa aatgtggcaa 19500 
tccgaaaatg ttgatgcaac aaaaaagtgt tttttttttc attgaatttc agttttgaaa 19560 
actgatttct ttccaaaaaa' aaaacgaagg aaaattttga gaaaaaagtg aaaatccaaa 19620 
aatgctgatt ttggtttttt tttcaaaaaa aaagcatttt gcaaagtgtg tgcttttttt 19680 
cgaaagtttc agaaccttga gacaaaaaac caaaattgtg ttcccgagtg aagcccgcca 19740 
cgtggacatg gtcagacgaa tcttgttcgt gttcgcagcc aattttcatt tttgctgaac 19800 
gcataattgt tcaaagaaga ttcggtctaa aaagacgaaa ttgaaataga ttgtggaatc 19860 
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ctttgaaatt ttcttttgac aaaaggtcac cgttattcaa aaattgagat ggtctcgtga 19920 
ctaaaattaa acaatcaaga taatcatgat tgtgggcctg ttttaaaata cacttttcaa 19980 
aaacgaaatg taggctccaa tccaaactgc gcatcaagac caagaatata aaatttttaa 20040 
actcgggaga cgtagagaaa ctttgaatat taaacatcgc cgtcaagttt ccgtcagagc 20100 
gC gcctgaaa ttttttagag gcttctttca aaaagctacc catacaaata atcataagaa 20160 
aaacgtttta aaactttgca ttccacccaa aaatgtctga aattacccgt aaaaagaatg 20220 
tgtgaaggga gtgatttgag ggttctgtca aacagtttga ctgtttcgcg ttcgacgtgt 20280 
ctcgacgtgg atggtattga agaggaccgc gctgatcttg tgctggtcgt cgtcgtcttg 20340 
tcggaccgcc gcgagtagtc ttcagtctac caattacctg aaaatttgac aotttttgtg 20400 
atgtgaaact ggctgcctga agcaatgcca tataataatc ataataataa taatgaagag 20460 
ggatgaggat gcatgccaaa agaatgaaag gaaagacgct cttctacaac accagccgat 20520 
agtatttaga agaaaaagaa gactaaaaag agagtattgg gtgatgggag aaagaacaca 20580 
ataggggagg cagtgaaata gaacgagaac aatggaatcg gcagacattt gacactagag 20640 
gggccactgt ttcagtcttt ttcgcacttg aatattggaa gagggccaag aaggggagtt 20700 
ccaagaatgg aaaaagtggt aggtttgtag aaaatctgcc tttttttttt taaaatttcg 20760 
tgttcactac tttatttcgt gttcactcgt ttatgtcttc cattataggc aggcaaagtt 20820 
tcatgcctac atacctgcct catgcctatt tgactttcaa tataaaactt gatttttggc 20880 
attcttcatt ttataacaat tgtaactaat aataagcttt gcaaagtttt ctgaaagaaa 20940 
ttgtctaaat tttcctggta cactgaacat ttttcggtat aaaatctatg cgtatcaagc 21000 
ctatttctaa gagccgtaag tattttcagc tgaaaatgta aaccacggag tcaatattta 21060 
cttcgtatca tccatcttcc attccgtctt gtttacacct acggcaggta tttagacacg 21120 
aatgattgtt tttctcgttg cctaatactt tttcccccga aatattccca tattccagtt 21180 
ctgaacaatg cacttttcag cggtcatcgg gtccatccag ccctcattca gccctttcat 21240 
ttatcttcgt ttctactttt agacgaaaat gcaaaaaaaa gagaaaaaga cactctcttt 21300 
tgacgctcac attcgctcac attgctgtgg tagaaaaaca ctcactcggt ggctgctggg 21360 
aagggaaaac gagaaaatgt ttggtcacgc aatacgccta tatctttgat ttgactttga 21420 
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atctttatac atttttcacg gggttcaaaa acaattatga agaaaattgt ttgattaaat 21480 
tagaatgtag attctttata ttttcaatca aaaattaatt ttggaaaaat aactatccaa 21540 
aaaacgaaaa aagtaataaa tgagtacttg aaagtgaaat ggggcaatta aacaagataa 21600 
aaaagactaa aacgtgagac atctcacaac gggtcacggg caagaagtac acgagaaatc 21660 
gaacgtgagt ggggaggcag agacactcag ctgactgcct ggcctgacgc tcgctcacaa 21720 
aacgctctca ctctcttcct cgctttgccc gctctccgcc ccgggtcgtc agttcggtcg 21780 
atccatgttt gttcattttt ataggtgaaa atttatgtaa gggaacggaa aatgtaaagt 21840 
gatcgtggga aaatagaaaa acaattacat tgtaactttt ctggaccaag ttgtacccag 21900 
atgcaatatg tatatttttc tcagaaaata ctgtgttggg tttcgacagg atcgatttat 21960 
caaaagcaaa cgagtgtgcg tctcaacgag cactaaagtt cccaactaga gcatccttgt 22020 
tgtggtagaa ctacatagaa atttttaatt ttgatttcaa tagcttttct cttgttttct 22080 
caaaatttat tgaaaaactt atttactata aaacgaccaa cgacggatct ggaaactaca 22140 
gtactcctta atgcaaaagg caacgaaaaa tcagccagtg acttattttt tgttctggat 22200 
aaaaatcggg aatatttgca ttttgaattc gcactgtatc gataaacaaa acaccgaaga 22260 
tcacgccaaa atgactattg taactaacag gtacgagaaa gggacgcttg ttctacaaaa 22320 
ataattcaac aaattttccc caaaaaaatg tgaagtccgc aattctcgta gttttacgta 22380 
aatcaaaccg agcatgacac tctgacacca cgtgcgcctg aagatgtgcc tgcctaccat 22440 
ggatgcttta catttgctag ttccatgaca ccccatcctt tcagcttcca agatgaagga 22500 
gttcggagaa aattcgaaaa aatattgaga aaaataaccc aaaacattct gaaacattgc 22560 
ggaaaaaagt tagaaattat gtcgaatata tctgaaccaa tcaacaattt caaataaaat 22620 
acaaaaaaaa attggaagac cttaaatagt ctccgcccat attttggctt caaatgaccg 22680 
tacttcggaa tatggccgat ggccgtggca agacctccaa tcgtagtttt gagcggtcag 22740 
taagtgaaga ttaaaatagg aacagtaccg taagatcagc ccaggtgcgg atgtgggata 22800 
gaggaactga aaataatcga agaagcatga taactaagcc acgtggccac gttcgttttt 22860 
gcgatgttaa tagatcgcca cttcgtccat tgtcgttttg tttgtactaa gtctccttag 22920 
caattctctc gaaggcgggc cattgctatt agtaaaataa gctaccaatt ttacctttca 22980 
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atacattcat tcactgatgg ttttcctatc aggtgatcat ttttttgttc ttctcaatta 23040 
cactatctaa aaatgatgaa gtttttgctt cgcggctatt tggttgaagt gatgatatat 23100 
ccattgattg tcgtctccac ttgtgctctt tttacgtctt acaacttctt tttaagtgtt 23160 
ttgcgtattc actgtttcat ttattttttg cagaaaatga gcctgttcag caaatttttc 23220 
ggaggcatga tgcaagaagc tccgattact ccacaagaat ctattcaaaa acttcgggaa 23280 
acagaagata ttcttgagaa gaaacaagaa ttcttggaga aaaaaattga cgacgtaagt 23340 
tggaagatca gttttggtcg aattaatcac attaaaaagt gctgaaatcg aaatttttaa 23400 
actctcgagt ctcaagtgac tgtgacgtaa ttaaaacatt gctcagcatt tacattgttt 23460 
actgacgtct tttcgaagtt tagtcgagca aatccaaaaa agagcaataa aaatttctgc 23520 
tacgatacgt ttgggaaatt ggaatcatag ttttttaaac tccatttttc aaaaaataca 23580 
ttattagaaa atcagtaagt ttcggaaatt atttgagaaa cgtttcagga aagcaaaatg 23640 
ccgtgaagta tggaacaaaa aacaagcgga tggctctcca gtgtttgagt aggaagaaag 23700 
ctttcgagaa gcagttgatc catattgacg gagttttggc tactctcgaa catcaggttg 23760 
gtatataaaa atattagaga aataaattga ataacacggt ttttcttcca gagagaaacc 23820 
ctcgaaaatg cttcaacgaa tgctgaagtt ctcacggtta tgaaacttgc tagcgatgcg 23880 
ttgaaagcgg ttcataataa catggatagc gaccaagttc gtgatatgat ggataacata 23940 
g atgaacaac gagaagtggc gaaggaaatc gcggatgcta tttcaaaccc tggctttaac 24000 
aacgcaattg acgaggccga tttgctgcgc gagttggtgg atcttgaaca ggttcgtcta 24060 
taccaccaac atcgtgtaat tattagaaaa tataccagga agcacttgac aaagatttgc 24120 
ttgatgcgag agctccccca gtcacgcttc cggatactcc caatattgca cttccagcct 24180 
ccagaccgag agctaaagaa gctgacaagg atctagaaga cctcgaaagt tgggcaaact 24240 
aacttctcta agtcactttc atatttaatt ttcggctatt tttgtttcat ttgcatcccc 24300 
ttcatcaatc ctaccattct ccggagattc tcctaaatca actttctaat tacgacaaat 24360 
tcaaatagtt gaatgatttc tgtttagcca tttcattcga aacaaatttc cccaaggcta 24420 
cgatcaacac tcatcaaaat tgtaacatat tatcgagctt tttggaaatt tgtcatttta 24480 
tacatcttgg tccctttctc caaaatcttc caagcatgca ttaaagttcc aacttttatt 24540 
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aaaaattcat tctggcaaac atgttatttg taccggttga aaacgaaaac caagcgagaa 24600 
atagttacat ctcagatctc cctaacgatg gctcaacccc tttgacgctc atttactaat 24660 
gtttatactt ttgctcattt actaatgaat ggctcattta ctaacttgct gagatttttt 24720 
aatttactac tgctaattgt aagatatata tcatttatca tttactatat ataaagcgct 24780 
tattccgttt gtccatagtt tgtagtctat gtagtctttg tagtctgtga cgttttggct 24840 
tctggaagga tagtgagttg ggcttagtgt agggatatag ggggtactgt agtggtacaa 24900 
tagtggtacg gtaggagtac tgtatgatta cggtagtttc agaaaaatta gttttcagct 24960 
ccagaagtcg ggggccgcgc cggaggtgcg gtccacggct ggttttacat aaggtagttc 25020 
caaaaaatgt cctacttcca attactcata actcagttag cgcgctatag ctatagcgtt 25080 
tgagtttaaa aaaattgtgg ccaactgaaa tgctgtttgt cagagatgcg. agctctaaaa 25140 
gatgatcgaa atattctatt tctgcggatc tagaatattt cgatcatctt ttggagctga 25200 
catctccgca atcgctaaag ataactaaaa ggtaccaatt aacaaaatgt gttttacaat 25260 
attgccaaca acattttagg tttctttcgc tgattgttt* cttttggttt tggtgatggt 25320 
cccggagtgg tttttttcgc tggttctact attttttgga tcggcaggct ctgaacaatt 25380 
ggtt gtacaa tcttcttcaa cttcatcaaa ttatccagag ttatgttgtc gcttctgctg 25440 
tccaacatat tcatgcattt gacggaactc ttcaactttc tgcattgtca ctggattctt 25500 
ctttttcgat tttattttat gaaaacttta ctatcataaa caatagtatt tatcatgtta 25560 
caaatcagtt tggaatgatc tccttcattc aaaattctta atgatcagtc gattcactct 25620 
tagagccacg aaaaatgtgg gacaattgtt tgagaagtga aaaatagtta ttaatgttgc 25680 
aattagttgt acatataagt aatacatgaa aatacatctt aaaaatacag ttactactag 25740 
gtattattgc ttaaaattgt gttccaatct gccagtacta tgagcgtaat tcgttgatcc 25800 
aatcttcgaa tagccgtgag cacaggcttc gccggcactg cacacaaact tcacgattgc 25860 
acgatttgca gaggtagagg acgaacgact ttcctgtaat tggcgaaata ttgttttaag 25920 
ataaagttag taggaacgat cgtactgttt ttagaacgag actgtctagc tggtggccgc 25980 
atcgagcatt gatggcatcc aagaccttga acttcttcgc tgaatgatat acgatgcttg 26040 
aatatggatc cactgaaaat tgaggttata gtagattatt gggagctatt atgatttcac 26100 
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cc.t g ,. g ». ct g c„tc. g t »ctct«c a g .ttctc g c t.tccttttc ,cc g cttttt 26160 
Ctt,t M « ctat,.,... acta^at tt gg t g ,=,t tt g tc g ,„tt ...cattcc 26220 
ac g a gg c. g . ca,.catct g a,attt g c g t tttttccac. aat g =ataaa cttto.ta, 26280 
a,ca..cc g c ttct. ggg ca .catca g cta a.ct g t g atc ,t g ctc,t.t tc gg c g ttt, 26340 
gcg , g ., g c. t.a.t gg ta g aat.a.t,.a a g atatc gg t « gg ttc g c gg g ..tcc gg ,t 26,00 
t g t. g tcttt g a g ,ta»tca .c g caatttt ,tttc. g .tt c g tc.tc. gg t.tt tg tc g c 26,60 
.ta g cct g a g aact g t g c.c ac,tttt g tt ct g ..a.taa attt gg c.tt catt g .a,ct 26620 
ac.tc.tca t g a.ctacc, tc..t..ct ccat.t.a acca. g . g .a tt ggg . g ... 26580 
t g .c. g t g .t c..ctt g . g . at.tcttcc, g t g .ct=.tc a. gg .t.ttc a g ct,cttta 26640 
t gg c g ccttc .. gg .a.a.c tt g tt=tcc. tc.,.t g c g ,.a.t..tcc ,..tttctt g 26700 
c,.. g a S t g c t gg .tcc.c aa g t.ctttt g .ttaccaac aata.ta Sg c c. g tttc g .. 26760 
g ctt g ctt g t c.ctc.aaa tcaacct,., . g .....t=9 aaaaattcca attt...... 26820 

c g ttt g ttt. c g t.atc gg . tccttct. gg a. gg t«cat g .ctt g tt g t c gg ct g catt 26880 
a g ..t g .c,t tt.c 9ggg .. atcattattt .ttcc g a.ac g t,ctt ggg e tt g ttc tg tt 26,40 
t g ct g a»att tt,... gg tt ctccaatat t.. g c,a... ...ctt.c.t t..t..tata 27000 

, gg tctct. g c,cc g . g t. g cta.acaat ta.t.ttt,. ttacaa g ttt gg aa. g .tct 27060 
ttct g a,ctc g atc. gg a. g ...aacttct t g .aacttt. ,.a g at g aa, t g t g t g ct.c 27720 
cg tataaact tt.aa g ,t g e atg-t.aat ttctcctttt gg tcct g c g . c g ,tt...ct 27X80 
ttttaatcaa ttctct ggg c t. g ttttt.t tca.ta.cta g aaat g tt g t tt.ttttt g t 27240 
tccctactta aatc.tat g t tattttcttt ttccttt g t g tcttaca gg c tttttta g ct 27300 
g a. g .,.t„ c,attttcc g ataaaattt, tt g ctctat g tt.a. gg c g c ,t g catttat 27360 
tt g . g a g »c g gg tctc,ca. c g t 5 ctc.ct cctc gg ccc g attt g ttctt c g ttt g c g c g 27420 
g ttttc. gg c cttt.aa.oa t. g ttcc g tc g tttttttct c.atttct g c t g .a.t,. gg 27480 
ttt.att.aa ttt.ttttc, a„tctt gg t aaacattta, actcatatat tc g ..tttt 2,5,0 
cttcctctt tcccco.. ..ccaattt ctatta, g att,a g .aca c.tct. g ..c 27600 

, tg c....a. c.ctt g « .tctctct.c tttcatttta , gg ct,.ttt ttt g .. g ... 27660 
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aatcatgaaa tacgtccatt attgttgtat cccttgtttg catccaaagt tgactcgatt 27720 
gatctcttaa atgtggtatt ccgttcgaaa ttcgattgat ttttagaagt taacacattc 27780 
ggaatgatga taattcgtat caaaccaaaa ttgtcttctt ttcgcctttt ttgtgcagtg 27840 
tcagcattaa acaaaacgag aatattgaaa gttacgtggc gtttgcatct ctcaccacga 27900 
tgacatcacg aaatgcagac gacaaagacc ggtgaaaaat agtgcgctga atggtgaaaa 27960 
cttgcgaaga taacgtgtta cgggttgaga gagaaaacat tccgcgagac aatgcttttg 28020 
gtgagaggcg cagatggttc agagaacact agagaaaacc gcgcctctgt ccgctcacag 28080 
ccagccccat caagcctctt cgggcatcga cgcatagaca cacatcattt tgccccaatt 28140 
tcctttcatt ccgtcaagta tttcgcaact aatcgttatt gctcattaca atacacattt 28200 
tacagaagtt cctcttcttc tacttggtcc gaccgcatca gataactggg agatccagtt 28260 
gtgcatgttc ttgtgcccac acaaactcgc gcccatttac aattttatga tcgacaaccc 28320 
tcaagaaggt aagcatttaa acgtgttggc cgtgcgtctc aaaaaattgt taaaaaacct 28380 
ggcgacacgc gtttttccac aatttcattc cctagggcat tttgtatttg aagtaattct 28440 
attacgcgta cgcaatcgga cgaatcctgc aggtttgttg gtagtcaatt ttatcaagtc 28500 
gactgcctct tatgctttct gaaaaaagag aatgacagtt ttcgctaagt agtactaaag 28560 
cgatctttta tctttggcaa aaccttgata taagcattat cacagcatat catgcagatt 28620 
gatttagagt taagcatgaa atgtgcaagg ctaaaataaa ttacaaaata agtccatagt 28680 
ccattttagt aacagtatac atcagctgat agaatcacat gcgtaatgac aggtctaaaa 28740 
cattatcaaa caaaagacat tacaaaaaca agaaaaatac aatataatag aacgactatt 28800 
tgaaatgagc gtagttaaat tcggaacttc aatagattat catacgcgct tttaaaaaaa 28860 
tgtgtgttcc cttttctccg cgtttgcccg ctacaaaccg gtgagtcgga aggcataatc 28920 
gggttgaaaa aaaagtatca aacactgatg gtgtcttttt tagggaggtt gtccagaaag 28980 
agaaagaaac tgaagatttg cgaatcgata gcgtcgtcat ctctcgacgc cagtgaagtc 29040 
aagatcggtt acaatagtgt atgcgattcc caaaatccac atatcaaccg gactcgtgat 29100 
atttatcatt tgtaagtact aacaagagat gtgaacgtat ttacactcaa cattagcaaa 29160 
ttccagaaga agatctaaac aaaaactatc gaaatggctc tcaacgtgaa ccgcgctgtc 29220 
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gctgatccat tctaccgcta caagatgccc aagctgtcag caaaagtcga aggcaaagga 29280 
aacggaatca aaacggtcat ttccaacatg tctgagatcg cgaaagctct cgagcgtccg 29340 
ccgatgtgta tgtttatcgc cagttggctc gccattggac acaaaaataa ccattgtttt 29400 
tcagacccca cgaagtactt tggctgtgag ctcggggctc aaacgaactt cgatgccaag 29460 
aacgagcgtt acattgtcaa cggcgagcat gatgccaaca agctccagga tattttagat 29520 
ggtttcatta aaaagtttgt gctttgcaaa tcatgtgaaa acccggaaac tcagttggta 29580 
cgagatcatt gaattaataa tctgtctaat tttattattt cagtttgtcc gtaaaaataa 29640 
catcaagagc aagtgcaagg catgtggatg ttcgttcgac attgatctca aacataagct 29700 
gtctacattc atcatgaaga atcctccaaa gattgatgtc gatttttgta agtatcgttt 29760 
actaacattt ttcgattgaa cttatgcaaa attctgccaa aaattctatt tgcattttaa 29820 
atcctttcaa ttcgattttc cgtgtgcttc cagtgcatac aaacatgcta atttttggtt 29880 
tccagccaaa gccgaacaaa agaatggaaa gaagacatcg ggtgctgacg ccgccgccgc 29940 
cgtggctgcc gacataatcc acaacagcga caaaggcagt tcgaatgatg acgacgacga 30000 
cgattgggaa cctgaaccag tcgagccgaa tggcatgctg tcggcgggaa tgggcaagct 30060 
cgtgctggac aaggatcttg agaagagcga agaacagcgt ctcgacatgc ttcacacatt 30120 
cttcttgaaa gccaaggaag aaggtaagaa ttctgagcat tgataaaaag tattctcgtt 30180 
atttcagata gaatttctga tgccaaggga caaactgctc tacgtgacga agctgagaga 30240 
cttgagctga agcaaaaagc atctctcctt ctcgcgaacg tttttcttga tgagaaagta 30300 
atcactgaca aacaaatcag caaacaccgc aatcttctgc ttcgcttcac gttgaatgac 30360 
aagaaagctc aaagatacct gttgggagga gttgagcaag taattcacaa acatgaagcg 30420 
gaacttctgt ctaaatcagc tcacatcatt aagtcattgt atgatgaaga tgtctgcgaa 30480 
gaggattcgc ttatttcatg gggagagaag gttagtacca aatggagctt tgtttcgaat 30540 
taaagtttat atttacagcc gtcgagtaag tatgtctcca aatcttttgc caagaagatt 30600 
attgagaact ctcaaccagt gctcaactgg ctgaaagaag cggaagaaga aaccgaagaa 30660 
gagtccgacg atgagattgc ggtaagaaat atcagatttg tttttttttt ttcaatggtt 30720 
ggttttcagt tcggaggaga cgtcaaggag agtgaattcc ttcgtcaaca gaaggagaag 30780 
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.ctgctagag aa g ctca g ca aaaatcagcc aa gg ctacaa ac gg caat g c t g ct g ct g ca 30840 
tcc gg a g caa at g at g aa g a gg actt gg at att g at g aca tttaattgta cagatgcttt 30900 
tttaaaattt acct ggg cta cttatgtttt ttgtgtattt cttcccatat tcgaaccaat 30960 
tcaactaatt tcgaagaagc ctcagttttt ttttgctttc tccccctttc aatagtaagc 31020 
atcatttcat ttctgtcttc tgtcttttct gttcctacgc tgttttccct tcaccaaatc 31080 
caattcattt attcgtaaag tcattactat ttgttgttaa tcgtaaacat ttgggaatat 31140 
tcttgttcaa ttcagtctta tattacaaaa acacaatgtt caaaaaaaaa g aatcacttc 31200 
ag at ggg aac cc g tc g aatt c gg c gg tcc g at gg a g aata cacattgttt tttc gg aaa g 31260 
ttagcccatt ttcaaatcat cacccagctg atttcatttg cgacgaagcg ataaattgta 31320 
aa g a g cc g aa aaccttttgc t g ctc gg aac a.tactatat g tacaataa g gcttcactat 31380 
tgatggattc aaaact g at g g ca g c g attc tagaagcaac tt g tcc g aaa acaatgaaga 31440 
caat gtgt tc taaat gg tc g tt g aaa gg at gg aa g gattc g g t g taagtt ttaaatcagt 31500 
ttgataataa aatatgtttt tcttttacag atgggatgag aacaaagaag aagtgatgag 31560 
aata gg atgc ttggcaaaat tccgtgcttc tcgccatctt cgttatgctc tttttctcac 31620 
aactggtagc aaactagtcg aat g ta g tcc g ttc g ataaa atat gggg aa tcggttagtt 31680 
tccaacggat cgtcttattc ttccatcgcc catcacaatg caatcagaat cttcaaactg 31740 
gaaatgtttt gaaatcattg aaatcatctt tgagctgata t gg t g ac gg a agaaaaggac 31800 
g tct g aaaat gg ct g aatta ttataggaaa agatatgcaa gccgcacaat ggg ctccatt 31860 
gag ctct gg c aacaatctgc t ggg aaa g at ttt gg at gg a .tccgagagg aattgtggga 31920 
tga ttcaaat tacaa g tta g ctct g gaatc agaaaattat tattatataa aattactatt 31980 
tcagagatga acgagaagaa gt gg a g aaac g aat gg aaac tgaaagagat tatctattca 32040 
ctgctataga gcacatggac ttgatgtaca aagaaagagc aacaaaaaga g tatt g taa g 32100 
aatcagaaaa tct g c g taat t g tc g aca g a aataacgtat tccagattgt tc g a gg aa g a 32160 
attgttaact g at g ata g at cctacatcac accagatatt ca g a gg ctcc ttcccgactg 32220 
ggcttggccg ccgatcctcg tgaaaaacga gcctattcaa ccatcgctgc ct g taataat 32280 
cgatttccct aggtacttgc cttgatcttt aatttatcag aattaacttt caaattccag 32340 
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atcatctcca cttcgagcag ctgaaatatc acgtaggaag agcacatctc attcgacaag 32400 
cttgagtaaa aggcggtacc tcaggagcag atcgagaagt ctgtccaaaa gcccggctcg 324 60 
aagacgctcc agacatcttt cccgaagtgg atcccgtaca ccagctcaac ggcattccag 32520 
aagatccgaa agtacatctc gaagacgttc cggacggcac tctagaagtc gatctagaag 32580 
cccaccacga aaacgtccgg tacgccgatc aagaagcaga tccaggagca ggacaccaaa 32640 
ccgaaattgg acaagagcac ggagcagaac aagaagtcag gctaaaagta gcagcacttt 32700 
aacctggcca ctgagcccat cgagaagcag aagtaacagt aatgaaagga atttgaaaga 32760 
gaagaaagac cggaaaaaga aaaaatctga gaagaaacgg aagcatcatt ctaaatccag 32820 
aaaacaccgt tctaaaagat ccgaatccag agaagaacgt cacagaagac ggaaggagaa 32880 
gaaaagagag aaaaagaaga aacgacgtcg gagaagttcc actacttcag attaaacttt 32940 
atttttgaaa actagtcata actttaaaag tcataacttt tttaaaagtc ataacactgg 33000 
tttaatatca aatgtctttt caaatattct ctatttattt attcttcgta attaaactga 33060 
gattaagtac tgggtatatc attaataaaa ttacgatact ttgccgaata aatcagttat 33120 
aattacaatc tgtctgctgg tgaaaattgt acatgctatt ttcttgttcc tcattctttt 33180 
ttcattctct gtaaggtttt gttcgttttt tggaaaattc tgagagtagc cggaaaaaaa 33240 
aaaaaaaaaa actaaatacc tacagtaatg ccagaggcat atgctcaata attatcaaaa 33300 
attagttttc cgcggcgaga cccatcccca caaaagtatg actcccttga aagtcgtaaa 33360 
tgacaatttc ttgaaacaag aacatttgta tattaacgaa acacaaaatt ccgagaatgc 33420 
gtattgagca gcatatttgc cgagccaaat atctcgtagc gaaaactaca ttaattctta 33480 
aaaacactac tgtagcgctt gtgtcgattt acgggctctt tgaattatca ttgatttatc 33540 
gatagaatat ttaaaaaata aattcatttc gaaattagag cccataaatc gacacaaaca 33600 
ctacagtagc catttaaaga attactgtag ttttcgctat gagatatttt gcgcatcaaa 33660 
tatgttgcgc aatacgcatt ctcagaattg tgtcttccgt aataatagac agtggcttcg 33720 
ctaaaaacta agaacaaagt aaattaaagt ttttttctgt tcacttcaaa ttttacacga 33780 
tcttgaagca aagttcaaaa gagcatgaat caattggaaa gtgttcaatg caccctacag 33840 
atatgatttc ggggcagtgt aaactacagg gcacagacat aaaaatttaa attgttgaag 33900 
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actaaaatat aaacatatga attcaagggt cataataaat gtattttttt aaataatatt 33960 
tattaaatgt atgcatacaa ttaaatacaa cataattatc aaatacaaat attataattg 34020 
caacctgtcg gacaacaact ttgctgaggt gtcgtgtgac agtcagaatc cttgtcacac 34080 
cagctgaccg gctcagagac gatacatcgg aagttgagat gagtgactgg tggacattgc 34140 
cgacgcgttg gagcacaaca ctcacgatat cgagtcatgt cgatgcagcg ctgaaactca 34200 
ggaaactatg tggaatttag gtggatcacc caaccagctg cccttcaccg cactgataat 34260 
ttggagtgca gtacatgtaa tgggcagagc attgctgcat ttgcatcaca atcaatgaat 34320 
ttgcaaaggg cctggagatt ggcttggctg aaagagttga tattatttct attgatataa 34380 
taccctaaat ttacgaaaat tatgctaaat taggatttta gttataatcc tcgtcacatc 34440 
tgatctctga aaacttaaaa atatcctttt tggtagtgtg gcaccaaatt cgtgctgtaa 34500 
cagagaccaa aaacactact ttttcgacat ttcctctcct tgcagcgaaa aataaaattt 34560 
tttgaaaatc tgtgttttct catacccgga aaaaaccaac aaaaacggcc ttgttccaaa 34 620 
ggcggtgagt atttctattt tatgaaagtg gccgagattt ctctttttct acgccaagta 34 680 
gttaattctt cgcggcaaga cccatcaatt ttctaacctc taatctcttt ttcaacatga 34740 
atatccacgt catcatagaa tttgcactcg ggcttataga tttggagcct ttgaaagtat 34800 
atgcaccagt ctatatgggt gttgggaaac gaataggcag tagttttttg gaccaattgt 34860 
agaatagaca gtagtaatag ggaagaatat aagaatttca taattcagat ttcaataaaa 34920 
aataaattta attgagaaaa aaaacggttg atattctttt gtttaagcag acaagtatgc 34980 
ggaagtgaat cttgagcacc tcgtaaatca cgggaggcgt acttgtacag aagagagata 35040 
agggattaag aggcgcaagc tttgccactt tgaagttaaa aaataaagaa agagacatgc 35100 
aaattggtgg acaaatagcg gaaggttagc gggaggtggg aggggggaca ggtgcatgta 35160 
acacaatgga ttttacaata ggaatattga aaatacgcat atgggaaatc ggaacagata 35220 
tgaaggtgtc aatatttgag gtcaactgtc tggtttttcc ccgatttttg aattttttga 35280 
aaaaaagtgc ataattcaca gattgaaatt ggaaattggt cgagaaaaga ataaggagtg 35340 
ttatgaattg atggtggcaa caaaacacaa attctacatt tgtaccaaaa tgcccactaa 354 00 
aatgggcata ttcgcacaca ttccacacaa attgcataca tattccacaa tggggaatat 35460 
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tttgaatatt tagattaata aagatgaaat aattgagttt tatttgtaat taaaatattt 35520 
ttctgtttat cattaattga aaatgttgaa ttacttttta atagacgaat catcaaagaa 35580 
cttgatccct gcattatcag gcaatcctac ataacctttc aacgttgtcg ttttaccaat 3564 0 
tgcaacattt ctcgctactg gaacacgcat actggaatac gatgacgatt ccaattggaa 35700 
gaatatattg gtgcccggtt ggaagttaac aattgaattg ttgttaagcg ataaaggata 35760 
cacattgata acatccaaaa gttcagttat gtatatccat ccgtataaat cttgcgatct 35820 
tccattcacc aaaagctggt cgccatcttg tataggaatg aatggagtta aggatcccgt 35880 
aacagtacga gttgtgagcg tagttccact gaaaattact aaatatttag ttcaaaggtt 3594 0 
ttctgttact actttttggt tgcaacaact ctgagaaatt ttagttttca ccaaaatttt 36000 
tcgattttgt acagaattgc acaatatatt ttggaatagc aagaaattgt tcagtgaatg 36060 
tcaaatctga caaaaaaaaa tttttttaaa aggtgcctat caatttttaa aaatgttcta 36120 
atattttgtt ggaaagtttc aataatttca ctacatttac tatttctttt ttaggcctat 36180 
tttgggtatt caaaatatta accacacgac cttcaataca ggaaaactgt caaatttttt 36240 
ttaaattatg aacaattaac tcactttaca ttttgtcctc cattccttgt agttaatata 36300 
agacttccca acgcttcttg agaactattc gaaataatat aaatcttcga atttcttcct 36360 
actatatatc ctagtgtgtt gctcgttgca acgtctagag tatccaatat aaacccacta 36420 
gaagctgata taaagaaaaa taatagaaat atatttttca ttttttccaa atgactaaat 36480 
gaccaacttc aagacatttt atatgcttaa aatcacgtca cagaactata atcatgttga 36540 
tttttgatag aaaatgataa gaaatgcgac caaaatgtgt attttctccg tttgtcctct 36600 
gaatgagtca aattcacgta aaacttggca tttgtcacag tgtgtcagac acaaggcaca 36660 
tatgtattta ccggactttt caagacttta ttattattga gatcaaacca gattacagaa 36720 
gacgggagaa aggtaccaac aaatatcaga atattgcaaa aaaaaattaa aaatttcaaa 36780 
acgcaaactt caaactagga gagctaattc aaactttgaa atcatgttcc ataaccggta 36840 
gcatttgttc ggtgacttgt ttgacagccc attgaaggaa gagaagtact cccgacaggc 36900 
tgaaacatat gaaatagtcc aggccttcca ttagagaatg tgatgtttga aggaagaaca 36960 
atgggacgta gagtactccg aatagagcag taagtccatt gatgagctga aacagtaaat 37020 
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aatcgaaaag ttagtaaata tgttcaagga 
ggcgttttat agttttttct ctttttttga 
ggttttttcg agttccagaa aaaatattta 
cgaaggctac tccaaggtaa gttaacccta 
cggggcgcga ctagatcacg ggttcgcgct 
cgccacgtcc gcgcttcagg aggagcgatt 
tacttaaaaa ttaatcgatt tttttaaaaa 
gacacaatgt tgaaccgtac tagattgttg 
aaaaaatttg aataattttt ttctaacttt 
aataaccatg aagcacaact tcataagtgg 
ctgaaaacaa aagagctgat atggagcaag 
tcacactatc cccttcgatt ttgacgcggt 
aatggattgc gattcaaaag aaaacgtagc 
caaaatgtat ttagaaaaaa ctcacgctaa 
catgattact gtagatgcat agagtgagaa 
aacgacaatc gaattggccg ccatcatctc 
atatatgaca ctttttaaat gaattgacag 
aggagtggtt caaatgattc ataaagggaa 
aaaaaaatat tggtgaatgt ataatttttt 
cgaagattag caagagtttg gggataaccc 
tgaaagaatt aaaaaaatcc tttcaaattt 
cgaacattga gcagaagaag cttttaccaa 
ttaaggatat cacccagttc aaaatagtag 
attcgaattt ttacagtaca ttgatcgttt 
aaatagataa tgaagctgcc aagtttaaaa 
cggtatatca ctacctggat attagtttta 



atggaagtaa accggaatta tccgagtatg 37080 
cttcgttttt catcctatta aaatatcatc 37140 
aaaaatcatc cgaaatccga acacaaaatc 37200 
ctcggcaaat ctctcgtcct ggagcgcgga 37260 
ccagtcaccc tttttttcgc gcttcttacg 37320 
tgcggagtac cttttatgca ttcagactgg 37380 
gtgtcataaa ctttttctac gtctttttct 37440 
taaacacggt cttcaaattt gattttcgcg 37500 
tttcttttta aaatcttacc acacttagca 37560 
atcctatttt tcgtttgaag aggcaaaata 37 620 
acacgtggat ccagaagagt atacgcacaa 37 680 
acagaattct ggaatttttt tttgaacttt 37740 
ttaatctcca gttaaagctg attttcattg 37800 
taaggcggag agtattgtct gtagaaccgc 378 60 
tgagcacata taagcgctcg gctgtttttg 37920 
attcttcgac ctcccgtttt atttctgaaa 37980 
aaatctgatg ctaactacat tttaacttgt 3804 0 
tacaatttct gaatgatcaa agaagaaaga 38100 
aggggtaaag taaataaata aacacaaggc 38160 
ccgtgaagaa aaatatgaaa aaaaatggtt 38220 
gagattcaaa ttttgttcat ctgttctgtt 38280 
taaatccaaa atttgttaag agaatatagt 38340 
ttcaaaaact cgagtcttaa ttttttcagt 38400 
cgttatttga tcgctttttg ataaaacaaa 38460 
aaatcggggc taaggctaat ggagcataca 38520 
gacttcatca gatatttagt cagaaaagta 38580 
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cgtcaagaag tcggatacga aatgtataaa tttcttaaaa cttaaaactt cgagatatcc 38640 
agactgtggc tctcaagctt cagtgcttgg agaaatagtt taatagtcag aatatgtttt 38700 
aaatttctta atttttctga agaagtcgta aaagtataaa tgttgctaga tcaaacactc 38760 
tagaaaacct tcaccacttg agaatactcc agtctcaaat tttccctcga cgcggaagtg 38820 
tagaagggcg cgagattcag aagtaggtga aaattagacg gaaaactctc tcaaaattga 38880 
aatcaatgaa taggacaact gagacaatgt gcaggtgtat gtgtatgcac atggcaccca 38940 
cgtacacgca tacatcttat gttagagaag tacgtgtgct ccgctcatca tgtcttctcc 39000 
ttctcctaca tctacatttt ttgctccgtg agccacgccg ggaaaaacga cgacgacgac 39060 
ggcgacgggg gacgactact cgactctaat tggccctaaa cgcaagtaaa tttttaggca 39120 
atgtatgttt gcgagagttg agagccccac cgccacgagg agaagtgggg gaagattccg 39180 
aagagattcc ccctcctcct tctgatcacc tcgtctttcc ttttttgttc catttccgtg 39240 
aaaaagctgt ggaagggagg agaagaactt accggctaaa tggaaaaaaa ggaactctaa 39300 
cttattctga ctctacggaa ataggaagcc tacttgtcaa ttagaccgcc ctcgcacaga 39360 
tttctttttt tttgtagata caaatataaa aactaactgc gtgtgatgca gcagatatct 39420 
tgaattggaa agtgtcagtg ctcagaggga atagccaatc attgacagaa atttgactac 39480 
ttcagaagga atcaactaga acatttgacg cctgaaacct aacaagaaaa atctataatt 39540 
tggagatccc tagattgatg ccaactttat taaaaactaa gtatacttat atatatacga 39600 
tttttttaaa aataaacctg attgtctgaa tttctacaag attgcgacca aattttccgt 39660 
atttccaaaa tctaatatta ggggtttcta ctaaaattca acgagaactc ttaacattat 39720 
ggttatttta acacatggtt caccgccggc tcaaacttca ttcttagtcc tctgattttt 39780 
ggtaaatcga cgcctacgtc tcaacaatta gtttgtgcag aaaataagta aaaagagttg 39840 
tgctccatct tgcacacata cacatcgcct gtaatgaaga ggttcggagt cagatgacta 39900 
ggcgtagaaa tgtgcgaaat tcacggataa cagagatttt tgatgtttca tcagacttac 39960 
acgttttgga agtatgaatt gggtctagac aacggagtgg cagatgttcg gaaaattttg 40020 
cagaaaagag aacctaagag cgttgatggt ttggtgacta acgaacttaa aagaaaattg 40080 
gtcattgaaa attttaaaat tttaaatttt gcttgcagtt catctttctc tattaacaaa 40140 
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aattattttg tagcttttct caatttcagg caattaaaac atttcaattt attcttctat 40200 
tatggaagtt tatctctaat tgaaactctc caattttgat caaagaacaa acgttctcgt 40260 
tgtttgaaaa aaaaaacagt tcttttttga aactcgcgcg caaattatta accaatcatc 40320 
ctcgtttgcg cgcaaaattg tagaaaaaat catttaaatt tatcaaaaat agtttaccat 40380 
tctgatgagt ttttcatata caaaaatgcc ctggcaattg ttgttttctc tgaaatagca 40440 
cataataatt gaactctacc cacataaagt tcgttctgaa aaacacctta caattattgt 40500 
gattgagagc caccccaaga gggattagaa aaacggatgt aatctgtata ccttcgagat 40560 
tcgtttattt ccttgtataa ccaatagcag gaaaattaca gctttttcta agtaagcggt 40620 
gaaactagag agattctata gaatatgggc gttaataatt gtatgttaaa gttttagaat 40680 
aacacaagtc cagagtaagg gcaagaaaag taatgagcaa cggaaaccag catgcaagac 40740 
acccgaattc cggttctctt ctgaaactaa aagttgcgtg tactaaacct taaaccagca 40800 
gctggctagt ctcaagaaat aatagaaaaa aggaaggaat gaagatatgg gaataataca 40860 
aattgaaaat gttgtgtgag ctccgaataa ttttcaatat caaaaattta tgaattgtgt 40920 
ggacggctgt gtgtgcgtgt gcgtatgcgt cggcaagaaa aagaagcgac cgaataagaa 40980 
aatggttgat tcagtgaaca aaaaaagaga gaaagatatc caaacaaaat tattcaaaac 41040 
tattatcaat cggtaggtat tgctctagag cacacctttc tggacactca gcagacatgc 41100 
gtagagaggg attatgtggt acatatagtg gatggaggaa cagatattta taaatactta 41160 
tggaaaagag gatgaagata ggatgaggta gatgaattga gaagatttta aaatgataat 41220 
ggatattgaa tttgaataag gagattctaa attatccgaa gaacacaaac tatatcaaga 41280 
ctacaaaata atctagacga gtcccagttt tgcaaggtaa ggattaatct taaaaggatc 41340 
ttttaaatat ttatttcaat gctcctataa attttaaaaa gtaggtgcat tctaatatgt 41400 
acagtgatta ggagatatgt gacgttacgt gaggtctcga taaagtacgg tattcgagct 414 60 
aaatttcaaa cattgtcaag gtagattcgg tacacagcca ccataaatgt tccactaaaa 41520 
atgtgttgtc cttctccttt ggaacacaaa tctagctgct gaactttttc acttcactac 41580 
atgtcaatgg gattgatatg catctaggac atttttttgg ttatcaatag tccgcatagc 41640 
ttgcgtaacc aatacaaccg attgtccaaa aaaatttgaa cactacaaaa cgtatttatt 41700 
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42360 
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t 
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<210> 24 
<211> 4434 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 24 

gccgccctcg ccaccgctcc cggccgccgc gctccggtac acacaggatc cctgctgggc 60 
accaacagct ccaccatggg gctggcctgg ggactaggcg tcctgttcct gatgcatgtg 120 
tgtggcacca accgcattcc agagtctggc ggagacaaca gcgtgtttga catctttgaa 180 
ctcaccgggg ccgcccgcaa ggggtctggg cgccgactgg tgaagggccc cgacccttcc 240 
agcccagctt tccgcatcga ggatgccaac ctgatccccc ctgtgcctga tgacaagttc 300 
caagacctgg tggatgctgt gcggacagaa aagggtttcc tccttctggc atccctgagg 360 
cagatgaaga agacccgggg cacgctgctg gccctggagc ggaaagacca ctctggccag 420 
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gtcttcagcg tggtgtccaa tggcaaggcg ggcaccctgg acctcagcct gaccgtccaa 480 
ggaaagcagc acgtggtgtc tgtggaagaa gctctcctgg caaccggcca gtggaagagc 540 
atcaccctgt ttgtgcagga agacagggcc cagctgtaca tcgactgtga aaagatggag 600 
aatgctgagt tggacgtccc catccaaagc gtcttcacca gagacctggc cagcatcgcc 660 
agactccgca tcgcaaaggg gggcgtcaat gacaatttcc agggggtgct gcagaatgtg 720 
aggtttgtct ttggaaccac accagaagac atcctcagga acaaaggctg ctccagctct 780 
accagtgtcc tcctcaccct tgacaacaac gtggtgaatg gttccagccc tgccatccgc 840 
actaactaca ttggccacaa gacaaaggac ttgcaagcca tctgcggcat ctcctgtgat 900 
gagctgtcca gcatggtcct ggaactcagg ggcctgcgca ccattgtgac cacgctgcag 960 
gacagcatcc gcaaagtgac tgaagagaac aaagagttgg ccaatgagct gaggcggcct 1020 
cccctatgct atcacaacgg agttcagtac agaaataacg aggaatggac tgttgatagc 1080 
tgcactgagt gtcactgtca gaactcagtt accatctgca aaaaggtgtc ctgccccatc 1140 
atgccctgct ccaatgccac agttcctgat ggagaatgct gtcctcgctg ttggcccagc 1200 
gactctgcgg acgatggctg gtctccatgg tccgagtgga cctcctgttc tacgagctgt 1260 
ggcaatggaa ttcagcagcg cggccgctcc tgcgatagcc tcaacaaccg atgtgagggc 1320 
tcctcggtcc agacacggac ctgccacatt caggagtgtg acaagagatt taaacaggat 1380 
ggtggctgga gccactggtc cccgtggtca tcttgttctg tgacatgtgg tgatggtgtg 1440 
atcacaagga tccggctctg caactctccc agcccccaga tgaacgggaa accctgtgaa 1500 
ggcgaagcgc gggagaccaa agcctgcaag aaagacgcct gccccatcaa tggaggctgg 1560 
ggtccttggt caccatggga catctgttct gtcacctgtg gaggaggggt acagaaacgt 1620 
agtcgtctct gcaacaaccc cacaccccag tttggaggca aggactgcgt tggtgatgta 1680 
acagaaaacc agatctgcaa caagcaggac tgtccaattg atggatgcct gtccaatccc 1740 
tgctttgccg gcgtgaagtg tactagctac cctgatggca gctggaaatg tggtgcttgt 1800 
ccccctggtt acagtggaaa tggcatccag tgcacagatg ttgatgagtg caaagaagtg 1860 
cctgatgcct gcttcaacca caatggagag caccggtgtg agaacacgga ccccggctac 1920 
aactgcctgc cctgcccccc acgcttcacc ggctcacagc ccttcggcca gggtgtcgaa 1980 
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catgccacgg ccaacaaaca ggtgtgcaag ccccgtaacc cctgcacgga tgggacccac 2040 
gactgcaaca agaacgccaa gtgcaactac ctgggccact atagcgaccc catgtaccgc 2100 
tgcgagtgca agcctggcta cgctggcaat ggcatcatct gcggggagga cacagacctg 2160 
gatggctggc ccaatgagaa cctggtgtgc gtggccaatg cgacttacca ctgcaaaaag 2220 
gataattgcc ccaaccttcc caactcaggg caggaagact atgacaagga tggaattggt 2280 
gatgcctgtg atgatgacga tgacaatgat aaaattccag atgacaggga caactgtcca 2340 
ttccattaca acccagctca gtatgactat gacagagatg atgtgggaga ccgctgtgac 2400 
aactgtccct acaaccacaa cccagatcag gcagacacag acaacaatgg ggaaggagac 24 60 
gcctgtgctg cagacattga tggagacggt atcctcaatg aacgggacaa ctgccagtac 2520 
gtctacaatg tggaccagag agacactgat atggatgggg ttggagatca gtgtgacaat 2580 
tgccccttgg aacacaatcc ggatcagctg gactctgact cagaccgcat tggagatacc 2640 
tgtgacaaca atcaggatat tgatgaagat ggccaccaga acaatctgga caactgtccc 2700 
tatgtgccca atgccaacca ggctgaccat gacaaagatg gcaagggaga tgcctgtgac 2760 
cacgatgatg acaacgatgg cattcctgat gacaaggaca actgcagact cgtgcccaat 2820 
cccgaccaga aggactctga cggcgatggt cgaggtgatg cctgcaaaga tgattttgac 2880 
catgacagtg tgccagacat cgatgacatc tgtcctgaga atgttgacat- cagtgagacc 2940 
gatttccgcc gattccagat gattcctctg gaccccaaag ggacatccca aaatgaccct 3000 
aactgggttg tacgccatca gggtaaagaa ctcgtccaga ctgtcaactg tgatcctgga 3060 
ctcgctgtag gttatgatga gtttaatgct gtggacttca gtggcacctt cttcatcaac 3120 
accgaaaggg acgatgacta tgctggattt gtctttggct accagtccag cagccgcttt 3180 
tatgttgtga tgtggaagca agtcacccag tcctactggg acaccaaccc cacgagggct 3240 
cagggatact cgggcctttc tgtgaaagtt gtaaactcca ccacagggcc tggcgagcac 3300 
ctgcggaacg ccctgtggca cacaggaaac acccctggcc aggtgcgcac cctgtggcat 3360 
gaccctcgtc acataggctg gaaagatttc accgcctaca gatggcgtct cagccacagg 3420 
ccaaagacgg gtttcattag agtggtgatg tatgaaggga agaaaatcat ggctgactca 3480 
ggacccatct atgataaaac ctatgctggt ggtagactag ggttgtttgt cttctctcaa 3540 
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gaaatggtgt tcttctctga cctgaaatac gaatgtagag atccctaatc atcaaattgt 3600 
tgattgaaag actgatcata aaccaatgct ggtattgcac cttctggaac tatgggcttg 3660 
agaaaacccc caggatcact tctccttggc ttccttcttt tctgtgcttg catcagtgtg 3720 
gactcctaga acgtgcgacc tgcctcaaga aaatgcagtt ttcaaaaaca gactcagcat 3780 
tcagcctcca atgaataaga catcttccaa gcatataaac aattgctttg gtttcctttt 3840 
gaaaaagcat ctacttgctt cagttgggaa ggtgcccatt ccactctgcc tttgtcacag 3900 
agcagggtgc tattgtgagg ccatctctga gcagtggact caaaagcatt ttcaggcatg 3960 
tcagagaagg gaggactcac tagaattagc aaacaaaacc accctgacat cctccttcag 4020 
gaacacgggg agcagaggcc aaagcactaa ggggagggcg catacccgag acgattgtat 4080 
gaagaaaata tggaggaact gttacatgtt cggtactaag tcattttcag gggattgaaa 4140 
gactattgct ggatttcatg atgctgactg gcgttagctg attaacccat gtaaataggc 4200 
acttaaatag aagcaggaaa gggagacaaa gactggcttc tggacttcct ccctgatccc 4260 
cacccttact catcacctgc agtggccaga attagggaat cagaatcgaa accagtgtaa 4320 
ggcagtgctg gctgccattg cctggtcaca ttgaaattgg tggcttcatt ctagatgtag 4380 
cttgtgcaga tgtagcagga aaataggaaa acctaccatc tcagtgagca ccag 4434 

<210> 25 
<211> 2837 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 25 

agagagccag tccgatgtct gcagcctccc tggccaggcc tctcctctcc tgccgcagct 60 
agtccccctc aggacagaca gagtactggc gtcggtcacc attcacttgc aaacacacca 120 
ggtcacgtga agaaacttcc tggtgacact caggctgtag ctgtgcactc ttcaaccacg 180 
aggttggttt tctcctaagt gtcacaggtg gagacaagat gctctgggca ctggccctgc 240 
tggctctggg catagggcca agagcttctg ctggtgacca cgtcaaggac acttcatttg 300 
accttttcag catcagcaac attaaccgga agaccatcgg tgccaagcag ttccgagggc 360 
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ctgaccccgg ggtgcccgcc taccgttttg 
cagatgatct caacaggatt gtcaagcttg 
cccaactgaa gcaggaccgc aagtctcggg 
cctcccagag gcagtttgag attgtgtcca 
actgggtaga aggcaatcag cataccaact 
agtggaagaa tgtgactgtg caggtggcca 
atcttatcga cagtgtcacc ctggaagaac 
gcaggatgta cgtggccaaa ggtgcatctc 
atgtccatct cgtgtttgca gattctgtgg 
acagccaggg agctgaagtc aacaccatca 
ctcacatcac cacagatctc gtggtccagg 
actcctgcga ggagttgagc aacatgatga 
accagctgag caagaacctg gagagagtgt 
ttgggggccc tctgaagaca agaaacatgt 
cagaaaatga aacctgggtt gtggatagtt 
cagtctgcca tcagatcacc tgctcacctg 
gcgagtgctg tccatcctgt tcacactctg 
cagagtggac cgagtgttct gtcacctgtg 
gtgatgtcac cagcaacacc tgcctgggcc 
gcaaatgtga tacgagaatc cgtcagaatg 
catgctccgt gacttgtgga gttggcaatg 
tgccccagat gggtggcaag aactgcaagg 
gtgatccgtg cccaattgat ggccgctgga 
ttacctgtgc tggagggatc cgtgagcgct 
atggagggaa ggactgtgtc ggggatgtga 
gccctattga t^ggtgctta tccaacccgt 



-77- 

tacggtttga ctacatcccc ccagtgaaca 420 
caaggagaaa ggagggcttc ttcctcacag 480 
gaacgctcct ggtgttggaa ggccccggca 540 
atggcccagg ggacactttg gacctcaact 600 
tcctggagga tgtgggcctg gctgactccc 660 
gtgacaccta tagcctgtat gtgggctgcg 720 
cattctatga gcagctagaa gtagacagga 780 
gagagagtca cttcaggggc ttgctgcaga 840 
aagatatctt aagcaagaaa agctgtcaac 900 
gtgaacatac agagactctc catctgagcc 960 
gtgtggagaa ggcacaggag gtgtgtacgc 1020 
atgagctctc tggactgcac gtcatggtga 1080 
ctagtgataa ccagttcctt ttggagctca 1140 
cagcctgtgt gcaggagggc cgaatctttg 1200 
gtaccacatg cacctgcaag aaatttaaaa 1260 
caacttgtgc caacccatct tttgtggaag 1320 
cagacagtga tgagggctgg tctccgtggg 1380 
gctctgggac ccagcagaga ggccggtctt 1440 
cctccattca gacaaggaca tgcagcctgg 1500 
gaggctggag tcactggtca ccctggtctt 1560 
tcacccgcat acgtctctgc aactcaccag 1620 
gcagcggccg ggaaaccaaa ccctgtcagc 1680 
gcccctggtc cccttggtca gcctgcacag 1740 
cacgtgtttg caacagccct gagccccagt 1800 
cagaacacca aatgtgcaac aagagaagct 1860 
gttttcctgg agccaagtgc aacagcttcc 1920 
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aacaaagctc 


2040 


r* r* n r* 1" n 1" n 1" 


raacaccaac ccooocttcc 


actgcctgcc 


ttgtccacca 


cgctacaagg 


2100 


y yaO^UQauv^ 


cttcaatcrtt aacctqoaqo 


atgetaggae 


agaaaaacaa 


qtqtqtqaac 


2160 




ai"otaaaaac aaaactcaca 


gctgccacaa 


qaatqcaqaq 


tgcatctacc 


2220 


tyyyuuavn' l» u 


taatoacccc atotacaaqt 


qtqaqtqcca 


gattggctac 


qcaqqtqatq 


2280 


nnpi" pat - rr 


f*aciciciaciciac tcaoscctoo 


atqqctqqcc 


caacaacaac 


ctoqtqtqta 


2340 




rarrtaccac tacatcaaaa 


acaactgccc 


caaactgcca 


aattceggge 


2400 


acraanaattt 


taataaaoat' qqaatcqqaq 


atgcttgtga 


cqaqqacqat 


gacaatgacg 


2460 


at* ci t* era n c a a 


tgagaaggac aattgecage 


ttctcttcaa 


tccccgtcaa 


ttagactatg 


2520 


araacraatoa 


aattaaaaac cactqtqaca 


actgccccta 


tgtgcacaac 


ccagcacaga 


2580 


tcgacacaga 


caacaatggc gagggggatg 


cctgctctgt 


ggacattgac 


ggagacgatg 


2640 


ttttcaatga 


gcgagacaat tgtccatatg 


tctacaacac 


tgaccagaga 


gataeggatg 


2700 


gtgatggcgt 


gggtgaccac tgtgacaatt 


gtcctctgat 


gcacaaccca 


gatcagatcg 


2760 


atcaggacaa 


tgatctcgtt ggagaccagt 


gtgacaacaa 


tgaggacata 


gatgatgacg 


2820 


gccaccagaa 


caaccaa 








2837 



<210> 26 
<211> 4108 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 26 

agagagecag tccgatgtct gcagcctccc tggccaggcc tctcctctcc tgccgcagct 60 
agtccccctc aggacagaca gagtactggc gtcggtcacc attcacttgc aaacacacca 120 
ggtcacgtga agaaacttcc tggtgacact caggctgtag ctgtgcactc ttcaaccacg 180 
aggttggttt tctcctaagt gtcacaggtg gagacaagat gctctgggca ctggccctgc 240 
tggctctggg catagggeca agagcttctg ctggtgacca egtcaaggae acttcatttg 300 
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accttttcag catcagcaac attaaccgga 
ctgaccccgg ggtgcccgcc taccgttttg 
cagatgatct caacaggatt gtcaagcttg 
cccaactgaa gcaggaccgc aagtctcggg 
cctcccagag gcagtttgag attgtgtcca 
actgggtaga aggcaatcag cataccaact 
agtggaagaa tgtgactgtg caggtggcca 
atcttatcga cagtgtcacc ctggaagaac 
gcaggatgta cgtggccaaa ggtgcatctc 
atgtccatct cgt;gtttgca gattctgtgg 
acagccaggg agctgaagtc aacaccatca 
ctcacatcac cacagatctc gtggtccagg 
actcctgcga ggagttgagc aacatgatga 
accagctgag caagaacctg gagagagtgt 
ttgggggccc tctgaagaca agaaacatgt 
cagaaaatga aacctgggtt gtggatagtt 
cagtctgcca tcagatcacc tgctcacctg 
gcgagtgctg tccatcctgt tcacactctg 
cagagtggac cgagtgttct gtcacctgtg 
gtgatgtcac cagcaacacc tgcctgggcc 
gcaaatgtga tacgagaatc cgtcagaatg 
catgctccgt gacttgtgga gttggcaatg 
tgccccagat gggtggcaag aactgcaagg 
gtgatccgtg cccaattgat ggccgctgga 
ttacctgtgc tggagggatc cgtgagcgct 
atggagggaa ggactgtgtc ggggatgtga 



-79- 

agaccatcgg tgccaagcag ttccgagggc 360 
tacggtttga ctacatcccc ccagtgaaca 420 
caaggagaaa ggagggcttc ttcctcacag 480 
gaacgctcct ggtgttggaa ggccccggca 540 
atggcccagg ggacactttg gacctcaact 600 
tcctggagga tgtgggcctg gctgactccc 660 
gtgacaccta tagcctgtat gtgggctgcg 720 
cattctatga gcagctagaa gtagacagga 780 
gagagagtca cttcaggggc ttgctgcaga 840 
aagatatctt aagcaagaaa agctgtcaac 900 
gtgaacatac agagactctc catctgagcc 960 
gtgtggagaa ggcacaggag gtgtgtacgc 1020 
atgagctctc tggactgcac gtcatggtga 1080 
ctagtgataa ccagttcctt ttggagctca 1140 
cagcctgtgt gcaggagggc cgaatctttg 1200 
gtaccacatg cacctgcaag aaatttaaaa 1260 
caacttgtgc caacccatct tttgtggaag 1320 
cagacagtga tgagggctgg tctccgtggg 1380 
gctctgggac ccagcagaga ggccggtctt 1440 
cctccattca gacaaggaca tgcagcctgg 1500 
gaggctggag tcactggtca ccctggtctt 1560 
tcacccgcat acgtctctgc aactcaccag 1620 
gcagcggccg ggaaaccaaa ccctgtcagc 1680 
gcccctggtc cccttggtca gcctgcacag 1740 
cacgtgtttg caacagccct gagccccagt 1800 
cagaacacca aatgtgcaac aagagaagct 1860 
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gccctattga tgggtgctta tccaacccgt 
ctgatgggtc ctggtcctgt ggctcctgcc 
gtgaggacct ggatgagtgt gctgtggtca 
cccgctgtgt caacaccaac ccgggcttcc 
ggaaccaacc cttcggtgtt ggcctggagg 
cagagaatcc atgtaaggac aagactcaca 
tgggccactt tagtgacccc atgtacaagt 
ggctcatctg cggggaggac tcagacctgg 
ctactaatgc cacctaccac tgcatcaagg 
aggaggattt tgataaggat ggaatcggag 
gtgtgagcga tgagaaggac aattgccagc 
acaaggatga ggttggagac cgctgtgaca 
tcgacacaga caacaatggc gagggggatg 
ttttcaatga gcgagacaat tgtccatatg 
gtgatggcgt gggtgaccac tgtgacaatt 
atcaggacaa tgatctcgtt ggagaccagt 
gccaccagaa caaccaagac aactgcccat 
acaacgacgg caagggcgat gcctgcgact 
acagggacaa ctgtcggctt gtgttcaacc 
gaggtgacat ttgtaaagat gactttgaca 
gccctgagaa caatgccatc actgagacag 
atcccaaggg gaccacacaa attgatccca 
tggtgcagac agcaaactca gaecctggca 
tggacttcag tggcactttc tatgtcaaca 
tctttggcta tcagtcaagc agccgcttct 
cctactggga agacaagccc agtcgggctt 



gttttcctgg agccaagtgc aacagcttcc 1920 
cagtgggctt tctgggcaat ggtacccact 1980 
cagatatttg cttctcaact aacaaagctc 2040 
actgcctgcc ttgtccacca cgctacaagg 2100 
atgctaggac agaaaaacaa gtgtgtgagc 2160 
gctgccacaa gaatgcagag tgcatctacc 2220 
gtgagtgcca gattggctac gcaggtgatg 2280 
atggctggcc caacaacaac ctggtgtgtg 2340 
acaactgccc caaactgcca aattccgggc 2400 
atgcttgtga cgaggacgat gacaatgacg 2460 
ttctcttcaa tccccgtcaa ttagactatg 2520 
actgccccta tgtgcacaac ccagcacaga 2580 
cctgctctgt ggacattgac ggagacgatg 2640 
tctacaacac tgaccagaga gatacggatg 2700 
gtcctctgat gcacaaccca gatcagatcg 2760 
gtgacaacaa tgaggacata gatgatgacg 2820 
acatctccaa ctccaaccag gctgaccatg 2880 
ctgatgatga caatgatggt gttccagatg 2940 
cagaccagga agactcggac ggtgacggcc 3000 
atgataatgt cccagatatt gatgatgtgt 3060 
acttcagaaa cttccagatg gtccctctgg 3120 
actgggtaat tcgtcaccaa ggcaaagagc 3180 
tcgctgtagg tttcgacgag tttgggtctg 3240 
ctgaccggga tgatgactac gctggctttg 3300 
atgtggtgat gtggaagcag gtgacccaga 3360 
acggctactc tggtgtgtca ctcaaagtgg 3420 
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taaactccac gactggtact ggcgagcacc tgaggaatgc cctgtggcac acgggaaaca 34 80 
cagaaggcca ggtccggact ctatggcatg accccaaaaa cattggctgg aaagactaca 3540 
ctgcctacag gtggcacctg attcacaggc ctaagacagg ctacatgaga gtcttagtgc 3600 
atgaaggaaa gcaagtcatg gctgactcag gaccaattta tgaccaaacc tacgctggtg 3660 
gacggctggg cctgtttgtc ttctcccaag agatggtcta tttctcggac ctcaagtatg 3720 
agtgcagaga tgcctagaga gcagggctcc agctccagca atgtgctgca aacacccctt 3780 
cttagacaca tcagtccatc ttggcacttg tggcttttct gtcatttggc atttcctgtt 3840 
tcttgacctt aactgagtgg atctacacct ccttcatcag caccaagtcc aagtgtcttc 3900 
aaaggagaaa catcaattgc actccaagag cttccagcct gctgctggaa aacatttgga 3960 
tgagatatga ggctcaccgt ggagcgaaga ccgagcattc cgctgtgttg ccttttcttg 4 020 
tttgtttaaa aagaatgacg tttacatgta aatgtaatta cttgcagtat ttatgtgtat 4080 
atggagtcga agggagcttt agagcaca 4108 

<210> 27 
<211> 820 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 27 

tcgaccagag gaggggaggc cagttcctct cccaagggtg ccacacaccc ctccctgttc 60 
atcaccagac aggcccttcc ttcttagcca tatgctaacc ttctcctccc tgggaaattt 120 
cctctgcagg agccaaagca gatgggagct ggagttgctg gagctcctgg tctgtatgca 180 
gagcaggcat ccaggaaagg agaagagagt gtgacaatcc agcacctcag aatggagggg 240 
cctcgtgttc agggcggaaa gtacagacgc aggcttgctg agggcctctg gacacaggct 300 
ggaccagatg ctgtggatgt cgacccctgc actgactatt ggataaagac ttctttcaac 360 
taagagaaga tgcaaatcag cacacttttt tctttgttct gccagcttcc aggcctaaga 420 
ctaggttttg ctgtctacag ccaactattc tattagttac aaaactcaat cattttattc 4 80 
agcaactgga tgttgactgt taactagaag ctctgtccta cttacagcac tttggatcat 540 
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caaaaaaata aagtaaaata gaaaactgag aaaactcaat ccatgaccag ggagaactta 600 



taggatgaaa attctcttag ttctttgata acaatttgtt cactcataga aacattatta 720 
at.tggtaggg taagcagaca ctctgaaaca atgagaaaaa tactaaaaat tgacttgagt 780 



<210> 28 
<211> 2397 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 28 

tttttttttt catcctactt tgttttattg ggcgttgatt gttacaggtc ccagcctgta 60 
gacatctttt actccaattt cctgaataga tagctttatt ccttcaaggt aatatagtgc 120 
ggtggcttct ggctgagatg tttgctgttg ttttcttcat cttgtctttg atgacttgtc 180 
agcctggggt aactgcacag gagaaggtga accagagagt ^aagacgggca gctacacccg 240 
cagcagttac ctgccagctg agcaactggt cagagtggac agattgcttt ccgtgccagg 300 
acaaaaagta ccgacaccgg agcctcttgc agccaaacaa gtttggggga accatctgca 360 
gtggtgacat ctgggatcaa gccagctgct ccagttctac aacttgtgta aggcaagcac 420 
agtgtggaca ggatttccag tgtaaggaga caggtcgctg cctgaaacgc caccttgtgt 4 80 
gtaatggaga ccaggactgc cttgatggct ctgatgagga cgactgtgaa gatgtcaggg 540 
ccattgacga agactgcagc cagtatgaac caattccagg atcacagaag gcagccttgg 600 
ggtacaatat cctgacccag gaagatgctc agagtgtgta cgatgccagt tattatgggg 660 
gccagtgtga gacggtatac aatggggaat ggagggagct tcgatatgac tccacctgtg 720 
aacgtctcta ctatggagat gatgagaaat actttcggaa accctacaac tttctgaagt 780 
accactttga agccctggca gatactggaa tctcctcaga gttttatgat aatgcaaatg 840 
accttctttc caaagttaaa aaagacaagt ctgactcatt tggagtgacc atcggcatag 900 
gcccagccgg cagcccttta ttggtgggtg taggtgtatc ccactcacaa gacacttcat 960 



caggatgtta gagacaaaac aagcagacac ctgaaacaat caacgcccaa taaaacaaag 



660 



tatttcaaat tgcctcattg acctgttata tcataactct 



820 
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tcttgaacga attaaacaag tataatgaga 
aggtgcagac tgcacatttt aagatgagga 
tgcagtcatt aatggagctt ccagatcagt 
atgactatgg cacccattac atcacatctg 
tggtgattga caaagcaaaa atggaatccc 
gttttggagg ctccttgggc attcaatatg 
caggagacca ttgtaaaaaa tttggaggtg 
ctgtggaaga cattatttct cgggtgcgag 
cacagaacag gagcaccatt acataccgtt 
ttgttatcga ttttgagatg cagcctatcc 
ctctggaggc caagcgccag aacctgcgcc 
atgcctgccg atgtgggcct tgcttcaaca 
gcaggtgcca gtgccgcctg ggtagcttgg 
gagccaaagc agatgggagc tggagttgct 
tccaggaaag gagaagagag tgtgacaatc 
cagggcggaa agtacagacg caggcttgct 
gctgtggatg tcgacccctg cactgactat 
atgcaaatca gcacactttt ttctttgttc 
gctgtctaca gccaactatt ctattagtta 
atgttgactg ttaactagaa gctctgtcct 
aaagtaaaat agaaaactga gaaaactcaa 
agagacaaaa caagcagaca cctgaaacaa 
aattctctta gttctttgat aacaatttgt 
gtaagcagac actctgaaac aatgagaaaa 



-83- 

agaaattcat tttcacaaga atcttcacaa 1020 
aggatgacat tatgctggat gaaggaatgc 1080 
acaattatgg catgtatgcc aagttcatca 1140 
gatccatggg tggcatttat gaatatatcc 1200 
ttggtattac cagcagagat atcacgacat 1260 
aagacaaaat aaatgttggt ggaggtttat 1320 
gcaaaactga aagggccagg aaggccatgg 1380 
gtggcagttc tggctggagc ggtggcttgg 1440 
cctgggggag gtcattaaag tataatcctg 1500 
acgaggtgct gcggcacaca agcctggggc 1560 
gcgccttgga ccagtatctg atggaattca 1620 
atggggtgcc catcctcgag ggcaccagct 1680 
gtgctgcctg tgagcaaaca cagacagaag 1740 
ggagctcctg gtctgtatgc agagcaggca 1800 
cagcacctca gaatggaggg gcctcgtgtc 1860 
gagggcctct ggacacaggc tggaccagat 1920 
tggataaaga cttctttcaa ctaagagaag 1980 
tgccagcttc caggcctaag actaggtttt 2040 
caaaactcaa tcattttatt cagcaactgg 2100 
acttacagca ctttggatca tcaaaaaaat 2160 
tccatgacca gggagaactt acaggatgtt 2220 
tcaacgccca ataaaacaaa gtaggatgaa 2280 
tcactcatag aaacattatt aattggtagg 2340 
atactaaaaa ttgacttgag ttatttc 2397 



<210> 29 
<211> 4100 
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<212> DNA 
<213> Unknown 

. <220> 

<223> Description of Unknown Organism .'Unknown 
<400> 29 

ggatcccccc gctccgctac catcttcatc gacctcaccc aggacgacga ctgagctccc 60 

tcttcctcgc cgcggactgg ggcgaccctg ttgctgctgc ggccgccgcc gctcctgccc 120 

ccacttcggc tcccgctcct gctcctgctc dcggccccac tcctgttcct gttcctgttc 180 

ctgttcctgt tcccggtcct gctccggctc ccggccccgc acccacctcc gctcctgctg 24 0 

cgggtctcca ggcccagaca aaataaaaaa agatatattt tttcagtccg tctctcccgc 300 

ccggtgtctt ctatggctga gggagtctgg ctctcggggc tctcgggtcg gctgggcggc 360 

tcggctggtt ggctggctgg cgagatggac cgctccggcg cgcagcgtcc gcggctgctg 420 

tgatgggtgg gcggagcgcg gaccggggat tatatacacg atgtgcatcc ataattgatg 480 

ttgtttgaga aaaacaaagt cataaagtgg cactcagaca gcactttggc ctggcgcccg 540 

gccaccatct gagtgcccaa ccgggcccgg cggttacatc acccccacat ggaccatcac 600 

ggcccattag caccaattgg ccagagtgtc gggagccacc gctaattgca gtaacgcgcg 660 

gctgccagac tgcaatttac cgcgcgatac tgcagtttac tgcagccgcg gtaaactgca 720 

gtacgcggcg gccgcaggaa atctactgta gtatttggcg gcggcgcgcg gtactgcaac 780 

tgtagtaaac tgtagctgca gtagagttac tgcagcgcca tcgggccggt gtggccgcca 840 

gggtaactgc acccgcagta aatttactgc agccggactt tgtgcgctgt ggagaccgcg 900 

ccgaactggg acccccccga ctcccccccg actccccccc gactcccccc cgactccccc 960 

ccgactcccc cccgactccc ccccgactcc cccccgactc ccccgggacg cgtccgcgcc 1020 

tcgatgcgcc ccatcgcgcc ccgttccgct tcgccacgct ccagttgccc cgcccccggc 1080 

acgtggcacg tatttccccc ccgtaaatca agagggatta tgcggatgtc tagtttatgt 1140 

ctcaatttcc tctttccgga gataaaagcc gggacccccg cgccgaaaaa ggatacacca 1200 

gccgcgatgt cgccgctcgt ggcggtgctg gtgttttttt cggcggcgct gggggttcct 1260 

ggccccggcg tcgcgggaaa cccccgtggg ctcgatgcca tcttcgaggc cccggtcacg 1320 

cccgcgcccc ccactcgcca tcctcggcgc gaggagctgg agtgggacga tgaggatcac 1380 
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ccgctgctgg acctcgagcc gcccgtggga tcacgctgcc atccctacat cgcgtactcg 1440 
ctgccgccgg acatgaacgc cgtcacgagc gtggtcgtga agccctactg ctcgccgccg 1500 
gaggtcatcc tgtgggcgtc tggcaccgcc tacctggtca acccctttgt cgccatccag 1560 
gccctggccg tcggagagcc cttaaatgag gcggccctca aggagctcgg agaggtggcc 1620 
gtgcacaagg actccctgcc gccgctgcgc tataatggag ggccccccgc cgagtaagag 1680 
accctgcggc ctgccgcccg gggtgcgcct cgtcgtgcct gccgccgccg ccgcttctgc 1740 
ctctaacgcc gccaccgccg ctgcagcagc agcccccgcc ggggccgggg ccggggcctc 1800 
gaagccggcc cgaccccccg ccgccgcccg gcccgcgaag ggcacgcccg cggcgtcggc 1860 
ggcaacaaca gccacggggg ccgacgcctc cgccccggcc cccgaccccg gggcgcccac 1920 
gtgggacgcc ttcgccgccg agttcgacgt ggccccctcg tggcgcgcgc tgctggagcc 1980 
cgagatcgcc aagccgtacg cgcgcctgct gctggccgag taccgcggcc gctgcctgac 2040 
cgaggaggtg ctgcccgcgc gcgaggacgt gttcgcctgg acgcgcctca cggcgcccga 2100 
ggacgtcaag gtggtcatca tcggccagga cccgtaccac gggccgggcc aggcccacgg 2160 
gctggccttc agcgtccggc gcggggtgcc gatccccccg agcctggcca acatcttcgc 2220 
ggcggtccgg gcgacgtacc cgacgctgcc cgcgcccgcc cacggctgcc tggaggcctg 2280 
ggcgcgccgc ggggtgctgc tgctgaacac gacgctgacc gtgcggcgcg gggtccccgg 2340 
ctcccacgcc ccgctcggct gggcgcggct cgtgcgcgcc gtcgtccagc ggctctgcga 2400 
gacccgcccc aagctggtgt tcatgctctg gggcgcccac gctcaaaagg cctgcgcgcc 24 60 
ggacccgcgc cgccacaagg tgctcacctt cagccatccg tcgccgctgg cccgcacgcc 2520 
cttcaggacc tgcccgcact ttggagaggc gaacgcgtac ctcgtccaga cgggccgggc 2580 
ccccgtcgac tggagcgtgg actgagtcgg gcgtgcgcgc acaccgccgg cggaggacga 2640 
ggagggggga ggggggtggg atggacggag gagagcggat gatggagccc gcgctcgccg 2700 
gcgccccggc cagcgcgctg ccggtcctgg cggtgctgcg cgagtgggga tgggccgtgg 2760 
aggaggtcga gccctccggg ccgtgcccgg aggacgcgga cgcgccccgg gagagcgcac 2820 
cccctccccg ggagggggtg cgcgggagcg aagacggaga ggggggcgtg gaagacggcg 2880 
aggaggggaa ggcgacggag aaggaggaga cggaagacga ggaagacggg ggggacgaag 2940 
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ggacgacgac ggcggcggcg ggcccgcgcc gggcgcagca cgtggagttt gacacgctgt 3000 

ttatggtcgc gtccgtggac gagctcgggc gccggcggct gacggacacg atccgccggg 3060 

acctggccgc ggccctggcc ggcctccccg tcgcctgcac caagacgtcc gcgtttgcgc 3120 

gcggcgcgcg cggcccgcgc ggcgcccccg ggcgcggcca taaaagcctg cagatgttta 3180 

tcctgtgccg cagagcccac gcggcgcgcg tacgcgatca gctccggtcc gcggtgcgcg 3240 

cccgacgccc acgcgagccc cgcgcgcgcc cgacgagcgg acgggcgcgg ccggccgcgc 3300 

cggtgttcat ccacgagttc atcacccccg agccggtgcg gctgcaccgg gacaacgtgt 3360 

ttgcggcgcc atgagcacct tcggacgcgc gtccgtggcc acggtcgatg actaccaccg 3420 

gttcctgcag gccaacgaga cggccgcccg gcgcctggcc gcggcctccc gccgcgtctc 3480 

caccggcggg ggcgagacgc gggccccgcg gtcctcgcgc ggcccccacg acgatgaggc 3540 

gcccctgcgc gccggcggcc tgggcaccgc ccgcgggcgc tcgcgccagc gcggcgcgac 3600 

cgagccggac cccgtctacg ccaccgtcgt ccagcctacc caccaccacc accagcagca 3660 

ccaccaccgc tctcagcatc cgcagcagca gcaacaacag cagcgggccc cacgccgccg 3720 

cggcagcgtg cacgcctcgg cgacggccgc ggacggaccc gagtcgtgcg cggccgcacc 3780 

cccgcgccgc cgcggcagcg tgcacgcctc ggcgacggcc gccccggcgg tccagctgcc 3840 

ccggccccgg caacggagca tcaacgcctc gacgacggcc gccccgacgc cccagctgcc 3900 

gagaccccgc cagcgcagcg tcaacgcctc ggcccgcgcc gccgtcccct cgacggccac 3960 

cctcccgcgc ccccggaccc cgtcccgggg ccggcgcgcg ccccccgcct catgctgtta 4020 

tcgcgatcaa taaagggcga gcgcccacgg accagacaaa agacacaacc ggttcggtct 4080 
ctctgtccgc gcacgcgcgg 4100 

<210> 30 
<211> 38734 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 30 

gatcctcgtg accgggtaca ccgacgcctc ctggacgccg ctgttcgcca tcgcgggcgg 60 
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ggtcgtcacc gacatcgggt cgatgctctc 
cgtcccgtcg gtggtgaaca ccaaggacgc 
cgtggtggac ggcgacgcgg gcacggtcga 
gggcccggcc ggggccgccg ggaccccggc 
gccgcaacac cggaccacga ccgcccccgc 
gacccgatga tccccaacca gtggtatccc 
aaaccgctcg gtgtgcgccg catgggccag 
aacctcgtct gccagggcgc ccgctgcccg 
atgaagggca acaccatcga atgcccgtac 
tgccgggtga tcccggcgat gggctccgag 
acctacccgg tccgggagca gttcggcctg 
acggccgacc tgccgccggt ggcggccccg 
gccaccaagc gctggacccg cccggtgcac 
ttctaccacg tgacctacgt gcaccgggac 
ctgtacggca ccccgagcaa gttcggcctc 
cggatcacca accaccgggt ggagacggag 
gaccactgcc aggaggacga ccccaccaac 
ccgtgcatgg tgcacgtgca gaccgagcag 
gacgaccaga acaccgagca catcctgcgc 
ctgaggttcg aaccgctgcg ccgtctgctg 
gtgcaggacc cccaggacgt ccgcatcatg 
ggcgtgaaca agttcatccc cgtcgacgag 
aagctgatcg cggacgcctc ggccgcgccc 
ccggaagcgg cggggcgggg cggatcagcg 
gccggcggac gacgcggcac caagcccaag 
agacggggga cggacaagag agagcgagag 
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gcacagttcc atcgtggccc gcgagttcca 120 
cacccagcgc atcaacaccg gcgacctgat 180 
ggtcgtcgag agcgcggaca ccgacccgca 240 
cggagccacc accgactgaa gccggccacc 300 
gaggggcgga ccacacccca gacgggagac 360 
atcgtcgagg cgcaggaggt gggcaacgac 420 
gacctcgtgc tctggcgcga catcgacggc 480 
cacaagggcg ccaacctcgg cgacggccgc 540 
cacggcttcc gctacggagc cgacggtgcc 600 
gcccgcatcc ccggctcgct gcgggtaccc 660 
gtgtggatgt ggtggggcga cgagcgcccg 720 
gccgaggtga cggacaaccg gaagctgtac 780 
tacacccgtt acatcgagag cctgctcgag 840 
cactggttca actacatcga ctacctgctc 900 
gacggccgcg agcggtacct ggccgccacc 960 
gcggaggggc agaccatccg ctactccttc 1020 
accacccact acgtcatcac gttcaccttc 1080 
ttcgagacca cctcctggct ggtgcccatc 1140 
tggtacgagt acgaacaggt caagcccgtc 1200 
ccctgggcgt ccctctacat ggagaagtgg 1260 
gaacaccagg aacccaagat cagcgccggc 1320 
atgaacgcca agtacatctc gatgcgcgcc 1380 
tcgtcaccgg cgcgggcggc ggagcccgag 1440 
gcccgtgcca cgggcaacgg caggggagcg 1500 
gaggacgccg ccgcgcgccc gtagacccga 1560 
tgagagatgt acggcggata cgacgcgtcg 1620 
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accggcccca aggccctggt gacggccttc 
ttcctgttcg gcggcgcgga caccgtggcc 
gtgaccctgc gccgggtcct gctggcgacc 
gccacgaact tcgtgatgct ccagcgcaag 
atctgggtcc tggtgatcca cggcacgatg 
gtgagcgcgt tcacctggct gggcgtcgtg 
gggtcggagt accagcgcaa actctggaag 
accgaaggcc tgttcaagca ctcgatgcac 
tccgggttcg cgctggtcac gggcaccccg 
tgcatgttcg tcttcctgaa catccccatg 
gaggccttcg acgagtacgc gtcccggacg 
cgcccgtcac gcgcgtacgg cggcctcccc 
accacagatc ccacagatcc ccacagatcc 
agatccacag atcccctcca cagatccgag 
tcggagaggc cgaggcgtcc ctgaccgccg 
ccgaggcgtt cgcccggctg acctccgacg 
cggagctgac cgacgactac ctgctccccg 
gggtcgacct ggagggactg gagaacgtgc 
accactccgg caccctgccg ctcgacgccc 
acagcacgca ccgcaggctc cggctgctcg 
tccgtgacct cgcccgcaag gccggccacg 
tgctcggctc cggcgaactg gtcggcgtga 
ccttcgagga gcgctaccgg ctgcagcgct 
tgcggtcgcg gcgccccatg gtgccgtgct 
tgatcggctc ggcccccacc ctggcccgga 
cgaccttccc gctgctgggc gcgctgggcc 
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aacaccgtcg ccgtggccgg cgccgtgtgg 1680 
gactggttcg gcaccgactt cgacgaggcg 1740 
ctgtcggtgc tctacctgct gcgcttcatc 1800 
atggagtggt cggagtcggc caccatcggg 1860 
gcgtacttcg gcggcaccaa cgacgccggc 1920 
ctgtacctcc tcgggtccta cctgaacacg 1980 
aagcgcccgg agaacaaggg caagctctac 204 0 
atcaactact tcggtgacgc cgtgctcttc 2100 
tgggccttcg ccatccccct gatcatggtc 2160 
ctcgacaagt acctcgccga gcgatacggc 2220 
gcgaagttcg tcccctacgt gtactgaccc 2280 
gggcgagggg ggccgccgta ccgggtggca 234 0 
ccacagagcc cctccacaga ccccctccag 2400 
acgaggcacg tatgaccgga gacattccct 2460 
aggtgctgcg cgaggtcctg gccggcggcg 2520 
agggcgccgt cgacgacttc ggcttcgacc 2580 
ccctgcgcct gctgtacgag aagtacttcc 264 0 
cggccgaggg gggcgcactc ctggtcgcca 2700 
tgatgctcca ggtggcgctg cacgaccatc 2760 
ccgccgacct tgccttcgac ctccccgtcg 2820 
tacgcgcctg ccccgagaac gcgctgcggt 2880 
tgccggaggg ctacaagggg ctcggcaagc 2940 
tcggccgggg aggcttcgcg gcggtggcac 3000 
cgatcgtcgg cgccgaggag atctacccga 3060 
tgctgaagct gccgtacttc ccgatcaccc 3120 
tgatcccgat gccgaccaag tggaccatcc 3180 
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gcttcggtgc cccgatccac acggacggct 
tggtcgagaa gctcgccggc gaggtgaagg 
tggagggccg cggctccccg ttcgtctgag 
cggctcccgg ttcgcccgag gaccgtccct 
cgacgatccc cggcggcaga tgctgcgaac 
gagcgtgacg ccgccgccga ccagctccgc 
acgacggacg gtgatggtca tatagccatg 
gcaccgcccc gcgaattcac ccgtagagca 
gggtggtcgc gcgttcgaac gcttacgatc 
caagcggatc ctcatcgggc gcgcactgcg 
gaagcgcatc gcgcttcccg tgttcgcgtc 
cggcgaggtg ctgctcgtcc tgtccatcgc 
gatcgcggtc gcggtcgtgg tcctgatgtt 
gcacgcctac ccgagcggcg gcggcgacta 
ggccggtctg accgtcgcca gcgccctgct 
catctcctcc ggcatcgaga acctgggctc 
cctgtgcgcg gtcgccgtga tcctgctgct 
gtcgggcacc ctgttcgcga ttccgacgta 
cgtgtggggg gcgttccgcg gactggtcct 
ctacgagatc aagccggagc acggcggcct 
gcgcgccttc tcctccggct gtgccgcgct 
cccggccttc cgcaagccca agtccaagaa 
gctggccgtc accatgttct gcggcatcat 
gtcggagaac ccggccaccg acctcttcca 
ccagcacccg gtgatctcgc aggtcgccga 
catcgtgctg gccgcagcca ccgcgctggt 
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tccccgagga cgccgcggag gacccgctgg 3240 
acaccatcca gcacacgctc aacgagatgc 3300 
ggccgcggct cccggttcgc ccgagggcgg 3360 
ctcgtccggg gccccgcctc agccccccgc 3420 
gctggcgaag gccagaacgg cgaggccgac 3480 
ggacagatgc atgggatctc cctcaggggg 3540 
cgaaccccgc cgtccgcccg atccgcagcc 3600 
gaccggtgcg gccgaggagg ggtggcgatt 3660 
ctctgttgtg tccaaactga ccgacgtgcc 3720 
cagcgaccgg ctgggtgaaa cgctcctgcc 3780 
cgacccgctg tcctccgtgg cgtacgcgcc 3840 
gggcgtgtcg gcctaccact tcagcccgtg 3900 
caccgtggtc gcctcctacc ggcagaacgt 3960 
cgaggtggcc accaccaacc tcgggcccaa 4020 
ggtcgactac gtcctgaccg tcgcggtctc 4080 
cgcgatcccc ttcgtcgtcg agcacaaggt 4140 
cacgctgatg aacctgcgcg gggtcaggga 4200 
cgtcttcgtc gcgggcgtct tcatcatgat 4260 
ggacgacacc atgcgtgccc cgaccgcgga 4320 
ggccggcttc gcgctgatct tcctcctcct 4380 
caccggtgtc gaggcgatct ccaacggcgt 4440 
cgcggggaac accctcgcga tgatgggtct 4500 
cgcgctggcc gccgcgaccg acgtgcggat 4560 
caacggcgtc gcggtcggcg cggactacgt 4620 
ggcggtcttc ggcgagggca gcttcctgtt 4680 
cctcttcctc gccgccaaca ccgcgtacaa 4740 
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cggcttcccg ctgctcggct cgatcctcgc 
cacccgcggc gaccgcctgg ccttctccaa 
gctcctggtc gtcgtctacg gcgccgactc 
cgtcttcgtg tccttcacgc tcagccagat 
ggccggcgag cgggaccagt ccaagcgacg 
cttcggcgcc ttcttcaccg gcctcgtcct 
cggcgcctgg gtcgcgctgc tcggcatgtg 
caagcactac gaccgggtcg ccgaggagat 
gagcgacgac atggtgcgcc cctcacgcgt 
ccgccccacg ctccgcgccc tcgcctacgc 
gctcagcgtc aacgtcgacc cggccgagac 
cggcatcgcc gtaccgctga aggtcctgga 
catcgagtac gtcaagagcc tgcgcaagga 
ccccgagtac gtggtcggcc actggtacga 
cctcaagggc cggctgctgt tcacgccggg 
ggagtcctcc gaggccgcca ggcgccgggc 
tgcggtgcgg cgcggaccgg cccaccacca 
ctcgtccacg tagactggac ggctgttgtc 
ccgccatgca ggcagaaccg aagaagtcgc 
tctcggagcc ggtctcgctg gtgggcgagg 
acggcggcca ctgcatcgcc cgcacgtccg 
tgcccggcga gcgggtcgtg gcccgggtga 
gggcggacgc ggtcgagatc ctggacccct 
tcgccggccc cggccgctgc ggcggctgcg 
gacgcctgaa gggcgaggtg gtcgccgagc 
aggaggccgg ctgggacggc acggtgatgc 



ccaggaccgc tacctgccgc gccagctgca 4 800 
cggcatcgtg ctcctcgccg gagccgccat 4860 
gacccggctg atccagctct acatcgtcgg 4 920 
cggcatggtc cgccactgga accgcaacct 4980 
ccacatgatg cgctcccgcg cgatcaacgc 5040 
ggtggtggtc ctggcgacca agttcacgca 5100 
catcttcttc gcgaccatga cggcgatccg 5160 
cgcggccccg gaggaccccg aggaggcaca 5220 
tcactcggtg gtcctgatct ccaagatcca 5280 
caagctgatg cgctccgaca gcctggaggc 5340 
gaaggcgctg cgcgaggagt gggagcgccg 5400 
ctcgccgtac cgcgagatca cccggccggt 54 60 
gtccccgcgc gacgcggtct cggtgatcat 5520 
gcacctgctg cacaaccaga gcgccctgcg 5580 
cgtcatggtc acgtcggtcc cgtaccagct 5640 
gcgcaagcgc caggactgga gcgcgccggg 5700 
ccaggaccgt gaccgtacga aggactcctc 5760 
cctgtcatcc ccccgttctc tggagtcacc 5820 
aggcggaaca gcgagcggtc gcggagccgg 5880 
agtacgaggt cgaggtcggc cccgtcgccc 5940 
agggccaggt gctgttcgtc cggcacacgc 6000 
cggagggcga ggagggtgcc cgcttcctgc 6060 
ccaaggaccg catcgaagcc ccctgcccct 6120 
actggcagca cgccaagccg ggcgcccagc 6180 
agttgcagcg cctggcgggt ctcaccccgg 6240 
cggccgaggg cgacaagctg ccggccggcc 6300 
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aggtcccgtc gtggcgcacg cgcgtgcagt 
tgcgccgcca ccgctcccac gagatcgagc 
gcgtcagcga actgggcatc gagcgccgtg 
tcgcggcgac gggctcccag gaccgccagg 
tccccatcgt cgaactggac cgcccggtct 
gcgtccaccg cgtccacggc cgccccttcg 
gcgtcggctc cggcggcttc tggcaggtcc 
cggtcatgca gggcctgctg ccccgcaagg 
tcggcctctt cgccggcgcc ctggccgacc 
tcgagtccgg caagcgcgcc gtcgaggacg 
tccgcatcga gcagggcaag gtcgagtccg 
acctcatcgt cctcgacccg ccccgcgccg 
cgaccctggg cgcccgcagg atcgcctacg 
acctggggta cttccaggac ggggggtacc 
tcccgatgac tgcgcacgtt gagtgcgtgg 
gacctgcatt tttcttggct ggatcaggag 
gcgcacgacg tagagcttgc ggaccgctcg 
cggtgccgcg atgtcggcgt ggtcggccct 
tgacgctcgg gtgacgctat ttctgaaggg 
ctctcgcgcc cgatgaagca ggttctctct 
ggatcaggta gacggtcagg gaccgcaggg 
atcagggcgc tgtcattggt gatggcgaag 
gcgttcagga gtctgtgcgg gcgccgcacc 
agctcgccac cagtccgaga ggagccgata 
gcgttgctgc ccccgctcgg cagaccgggg 
aacgggatcg tgtggagttt cggaccggaa 
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tcgcggtgga cgccgacggt cgcgccggtc 6360 
cgatcgacca ctgcatgatc gcggcggagg 6420 
actggcccgg catggcgacg gtcgaggcga 64 80 
tcatcctgac cccgcgcccc ggcgcccgcc 6540 
cggtcatgcg cgtcggggag aaggacggcg 6600 
tccgcgagcg cgccgacgac cgcacctacc 6660 
acccgaaggc cgccgacacc ctggtcaccg 6720 
gcgacatggc cctggacctc tactgcggcg 6780 
gcgtcgggga ccagggagcg gtcctcggca 6840 
cccgccacaa cctcgccgcc ttcgaccgcg 6900 
tcctgccccg caccggcatc gacgaggtcg 6960 
gcgccggccg caagacggtc cagcacctct 7020 
tggcctgcga cccggccgcg ctggcccggg 7080 
gggtgcggac gctgcgggtg ttcgatctgt 7140 
cgattttgga gcccgccgca aaggggctct 7200 
cggcctgttg cgctcgacct gttctccaaa 7260 
tgaaagccgc ctgacctggc gttgcacgag 7320 
tctcctggcg cgaagggaaa ccgaaggtct 7380 
tcgtcaccga ctggggaggc agggccctgc 7440 
gctccaggta atcgtcgagg gtgccctgac 7500 
tgcagggcgt cgacgcccag gtcgaggatc 7560 
gcgccgatca caccgtcgac ggaacaggtt 7620 
ggcacggtct ggaagctcgg ctccgaccgc 7680 
ctgtccggtg ccgggtgacc cttcgtgcaa 7740 
cagcaacgct tgcacgatcg gccggtactc 7800 
cggcttggca ggacgtgccc gagcggtacg 7860 
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gctcctgggc cacattgcac acccgcttcc 
gaaagggggt tccgcccccc cccgggacct 
ccggctccgc gctcccgcca ccaacctgac 
cgccgacgga ctgtccgacc aggccatcgg 
cgatatcacc tggcctgcat ctatgccaac 
gccactgtca ccgccatcga cgacctcggg 
cggtgtttcc gttctccacg acttcagcgg 
gtgcccgctc ctctgaaccc atgtgaacgc 
cggacctggc ctccgccgcg gccagaatgc 
acaggcgagg gcggtccctc ggagccactg 
actcggcggt caccgcggac gcggtgcggt 
gctcggccga gcgagttatt ccggcacccg 
gcttcgaaga ccgtgacgcg gttgtcgtcg 
gcgcggaatg cctccgacga ggtgtaggcg 
acggccaggt agccgtgtgc gcccttgcgg 
ggcacgttgg aaagggtggc gcgcatgctg 
gcgggcgcgg cgacggtgaa ctcggtgacg 
ttggtgtgat gtcggtggct gtcccgcccg 
tgcaccgcgt gtccgatcga gttccgttgc 
cgggtccgcg gtcggctgag ggagcctgcc 
agtcatgaac gggggtacgg cggccagcag 
cttcagccgg atcgcgacca ggacggtcag 
gagggtgccg aagatccgta cgccgagttc 
cccggccagc agacctgccc acgtgcacgc 
cagcagacgg ctggtgccgg tttcggcagc 
ttcggaggtg ggcgtggagg gtgtctgcgg 
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gtcgatgggt gaaaggcggc acctttcagt 7920 
tgcgcccacc gtcgccgacc ggctgatgaa 7980 
ccgacgtgag accgaagtcc tctcaccggt 8040 
cgcacgcctc cacttgaccg aaggcaccgt 8100 
ctcggaaccg actcgcgcac cgccgctgtg 8160 
ctcatccgcc gctgaacagt atgtggtggg 8220 
cgtccggagt tgtggtgctg gctgggcttg 8280 
ccacggccag ttcgagccgg acacgccccc 8340 
ccggcccctg cacctggtct gcctacctgt 8400 
cctgtaactc cgaggggccg cccttggccg 84 60 
caggaggcac cgtctgcgtc atcaggcgcg 8520 
tgggaccaga aatgtcagcc ctgcgtgacc 8580 
gagagcgagt gggtggggtc ggcgtgcgcg 8640 
gtgaatgcgg cgtcgtcctc gaagttgagg 8700 
ggacgcagca gccgtgcgtt gcgcagcccc 8760 
gcggtgcagt tgttctcgaa cgcgccctgg 8820 
gcggtgctca tttctgtctt ctctcggttg 8880 
gccggggccc gcacggcgat ggcgatgatg 8940 
ggggtgcggt tacagggctg gagttgggct 9000 
tgtgcggcgg gcccagattt cgaacgcgat 9060 
ggcgaagatc gttgtccggc ccagccgcca 9120 
ggacacgtag acgatgaagg cggcgccgtg 9180 
ggtggtttcg gggatgtact tgaggtacat 9240 
ttcgatgatc gcgatccagg tgaacgcgcg 9300 
ccgtgcggcc ggcgcgtcgg cggaaggcgg 9360 
ggtgcctggg cgtgcgggcc acagggcgcg 9420 
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gttgccgagc aggcggacga tggcggggac 
caggatgccg caggccatgg cgaagccgaa 
caggacggcg aaggtcgccg cgaggatgag 
tgtcagtgcg gcggtgatcg ccttcgctgg 
gctcatgatg aagatgttgt agtcgacgcc 
ggtgacgcgg ttgccgatgc cgtcgtcacc 
gcccagggtg gccaggaacg acaggagcag 
cgagcggagc agcaggacga ggaccacggt 
ggtcgtgtcg cggtcgaggg cggagcggat 
gagcaccgtg gcgtcctgga cgccggcggc 
gggaccgatc gcgtcgagtg ccttggagct 
gaagacggtc ttgccgtcct tgcccatgcg 
gtcggtgagc gcggtggcga tgtcggcggg 
atggacgacg acggtactgg gggcgatctc 
ctgtccgtgc tccgactcgg tggcggcgcg 
gtagccgagc agtcccgcgc tcagtaccac 
gacgggggcc cgtgcgacca gggcggcgat 
ttcggcggcc ttgtccacgc ccccgggcca 
ggcggggatg aaggtgaacg ccaccagcgc 
tccgaagccg tgaagtgccg gggagacggc 
ggtcgaggcg ctggcgagga cggactcggc 
ggcgcggtct ggctcgtcga gcagggtctc 
gtccgtgccc accccgaaca gcagcacggt 
gatgactccg gcgtccgcga gaatcgcgcc 
cacggcaaga agcggcacga gcgccatcag 
caggacgagc acgacggtgg ccagcagcag 
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caggagcggg cggatgagga aggtgtccag 9480 
ctggaacagt tcgcggatcg gctgggtcat 9540 
gcccgcggag gagatgacgc cgccggtgcg 9600 
gggctgggtg cgcagttcct gcttgaaccg 9660 
gagcgcgacg aggaagacga agatgtacgc 9720 
gaggacggtc acggtgaaga aggtggtggc 9780 
ggtcgcgacc aggtagagcg gggcaaggag 9840 
gacgatggct aggaccagca gcacgatgag 9900 
gtcggcgttc tgcgcggtct cgccgccgat 9960 
ctgggctgcg gattgtgtgg cctgcttgag 10020 
gtaggggtcg aggtcgagga tgacgtcgta 10080 
ggggtctgcg acacggctga cgtgatcggc 1014 0 
tgcggggctg gagcgcaggt tgtcctggga 10200 
gccgggcccg aattcctccc gaatgaggtg 10260 
gaagccgctg agggtgttga agctctcctg 10320 
caggagtgcg atcacggccg aggccacctt 10380 
gcggtgccag atgcctgcgc cgcgactgcg 10440 
gaagacgctc ctgcccagca ggaggaccag 10500 
catgacggcc acgcccagag cgaggtacgg 10560 
cacgagcagg gcaaacatgg cgagcacgat 10620 
ggtgcggcgc acggcggcct gcatcgcgcg 10680 
gcggtagcgg gcggtgatga tcagcgcgta 1074 0 
catgatcgag gcggtctggg agctgaccgt 10800 
gagagtctcc gccacgcgca tagccacgcc 10860 
gggcgagcgg tagatcgcca gcaggatgat 10920 
gactttgtca ccgccgctga agaccttcac 10980 
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ggtgtcggtg gcgatcccgg cggggccggt 
ggacgcgagg gcacgcacct cgtcgaccgc 
ctccatgggc acgatgacca gctgagcacc 
gtcgggagcg gtcaccgtgg agaccacgct 
aagggccgag gtgatggcgg cgaccgattg 
gccgcggacc acgatgatcg ccggcgtcgc 
atcacgggcc ttcatggagt ccgaggcggc 
gacggattcc agggccgggg cgaccccggc 
caccaccacg gcggcgcgct tcttcgatcc 
catcggttgc atcgggcagc cttcggcagg 
ctctaagttg cgtcaagcta acacgccccc 
gggagacggc acagcgtccg aggtgaagcg 
gtccgggagc accgggcgca gcggctcgcg 
gaagagggcg gtgccgaagc cctcacagcc 
cgcaacagca tctaccgcta cttcaactcc 
cgcgaattcc ccgcctggat cgacgcagtg. 
gccgcccagg ctgccgccta cgtcagggcc 
ggctggcggg ccgcgctcac gcgcgactcg 
aatctgcaca tctcgctaca cgaggcgctc 
cagcccgagc tgaccgtggc ggtggtccaa 
gaccaaggcg acgatctgac aaccgtgtcc 
ctcgcggatg acgacttgcc acatcacccg 
ccgtcgacag cgaagccccg gcagaacgag 
cagaccgctg gtggctcatg cctcgtctca 
cagtgcccga cgcccatcga ttacgacgtc 
gacgttcgtg gtgcgctcat tggtcacccg 
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caccgcgacg tcggcgggcc cggcccggtc 11040 
attctggaag gactcgtccg aggggctgcc 11100 
gcggtcctgc gagaccaact cggccgcagc 11160 
cacgacatgg tcgggtcggc tggttccgga 11220 
cgtggcgctc ttcgcggcgt cggtgccctt 11280 
gtcctggccc ggaagctggg cgcggacgag 11340 
gggcggcagg ttggcggagg cgttgtcctc 11400 
gaggaggccc gcgatcagga cccagaaggc 114 60 
caggagacat cgcagcagag cgggggagtt 11520 
aagtacggac agaacttagc gacagggtgt 11580 
tcggcctctc gggcgtgggg gtaggttggc 11640 
gagaaaatgc ccaagattga agccggcagc 11700 
cagctgattg acgcggccga ggagctcctg 117 60 
ggagcggttg ccgcgcgagc cgggatcgcc 11820 
atcgacgacc tgctcgaact cgtcgtcacc 11880 
gagcaggcca tcgcggccga gaccacaccc 11940 
aacctcgaac aggcagctcg cggcacccac 12000 
ctctccccgt cggcgcggga gcgggtgagg 12060 
gcccgggtcg tgcgcgaact ggggcagcca 12120 
gcagtcgtcg atgcgtgcat ccgcagaatc 12180 
gacttcgcgg ccggagcgac gcgtcgactg 12240 
tgacgcaccc cgtccaggcg gctcgcaggc 12300 
ccggatcttg agccgcaccg gagcgtgacg 12360 
tccgatcttg ccaccgggcg gccgaccggt 12420 
cacgacccga accagcgcgt tcagtgcgtt 12480 
gcctctgggg gtcaccagcg cttttagggc 12540 
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acgagactcg acggtggcgc gtgataccag 
tccggcttcc cgacgacctg gacacgaagc 
gcaagcagga acttgccatc ggggccattc 
tcgatgacgt tctggccggt ctgatggaca 
gagcggcgtg cgctacctcc agatcgacga 
tgccgagcac agcgtgcgtg acatgggcct 
gaacgtcttc ggagccgagc tgtatcccac 
tccgtcgccc gcaatcacgc gctgatcgac 
cgcgtcttcc tgcggttcaa cggcgccagc 
cccgacggac ccgaggcccg tcacgcgctg 
ctgggaccgg cgctgctgat cgccctgtcc 
ttgtgggtct cggtggtcag tgaggtggcg 
ctgcgtgtcg gcgccgggcg tctgggagcc 
ggtcttggcg tggtcgctgt caaactcatc 
tcgcggagcg catacggtgg cagaccggtc 
ccggcacccc ggcgaacagc cgcacgtccc 
atccaggtca gtcgccacag gcgcagaagc 
cccacgtcct ccccggacac cacgaagtcc 
tcgcgggcga cgatccgctg aagggcttcg 
aagcgctgac tgctcgtgat cagccggtcg 
gtgccgtccg tgccgacgag aggaccgccg 
cagccgacgg cgtaggcgtc gagctctccc 
tcatatcctc tgagggtcac cttgaacagc 
gcagcctcct cgtgggctga tccgcgtccc 
atttcctcac cggcctcgac gaggaggcgg 
ttgtcgaaga aggaggtggt gtggcccgtg 



PCT/US99/01313 

-95- 

gcaggcatca tgaccttatg gcgatgacac 12600 
ttacggagcg ggctcgtggg gagggttgca 12660 
gtgatgcccg ggaccgggcc gagctgaagg 12720 
gcgatgcgga gattctggac tacctgaagt 12780 
gatcctggcc atcgtgcgca cggtcaacgg 12840 
ccttgtgtcg gcgatcgaac ggccccggac 12900 
cctgcacgag aagccgcggc actactgcac 12960 
ggcaacaagc gcaccgcctg gttcgccatg 13020 
gccagtaccg tcccgcccca cgggcgccgg 13080 
ctcaccagca gccctctcct cagcagcgca 13140 
gccctggggg ttctcgccct ggacacggcg 13200 
gcgccggccc ggtggggctt cgtgggcggg 13260 
ctgatcgccg gcgtactcaa cgccgtgatc 13320 
gccgggcact gagagggcct gtggtggtgt 13380 
ggaatcctcg gcgccgcggc cggagcggtc 13440 
cgtccggtcg ggtcaggtcc gagccgtcag 13500 
ccggtgccgt ccaccgcgta ctggccgccg 13560 
ttggcctgcc acagcgggac gacgggcacg 13620 
aggtcggctt cggcgtcgct ccggtcggcg 13680 
gcggccttgc tgccgtaccc cgtcgccatg 13740 
aaggtgtcgg gatcggggta gtcggcgacc 13800 
tcggcccagc tcttdtggaa ttcgtcccat 13860 
ccgtcggcct ctagttgctt tttcacctcc 13920 
gccgcgtaac cgtaggtgaa agacagcggg 13980 
cgtgcctttt cggcgtcctt gtgagggtag 14040 
atgctcgtcg ggatgaggga gtagagcggg 14100 
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tccacggttc cgtcgtagac gtcgtaggaa 
gcctgccgtg cgcgtctgtc gtgaaacggc 
cgagtctccg cgctctgcgc ctccgtcacg 
gcgagcattt cggggggaag ctgtctgagg 
cgggcgagtg agtcgggggt gtcgtagaag 
gcggcgccct tgtaccgagg gttgggcgag 
acgacgccgt acggtccggt cccgtcgatc 
tacgtggtcg agtcgacgat cgagcccgcc 
tccggtgcgg tcagtcggaa agtgacggtc 
tccaggaggg acgacggccc cacgtcggaa 
cggacgtcct tggctgtcat tctgcgtccg 
catcgatagg tgcgtaggcc ggaatcggtg 
ggctccgcca ctccgggctc cagggtcagc 
gtcgagccgg agtcgtagcc gccggccggg 
accttgatgg tggtgccctc ctggtcgtcc 
gccgtggcga tcaccgtggg tgtgatgacg 
atgaggctcg tcctcgcaag atcgagacga 
tgcgtcggcc aagtatgcgc aggcgtcgct 
gaagtcttat gactgacacg gtggaactgc 
ggtcggcacc ccctctcagc agccgttccg 
tgcgcgggtc ccgcccacgg tctcactcct 
gccatgttcc gtgttgcagg gcgtggccat 
aggtggcagt gcggtactcc gtttcccacg 
cacggccgga tccctcatgg gaggcgctgt 
ggcggcgctg ccgctggccc tgaccgtcgc 
cggcgcagcc gggaacaccg ggtccgtcgt 
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atccggtccc tgtctatcag ccaggccgcc 14160 
ttgccgcggc ggttgttgag gtacaggttt 14220 
cgaagccccg gatcgctcgg gttcagatcg 14280 
gcgacatcga tgcggtggga tatccaggcc 14340 
tggagttcga tcggccggcc ggtgttctcg 14400 
agggagatct tctcgccctt cgcgtaggag 14460 
cggccgtccg agcgcaggga gtccgccggg 14520 
ccggtcgtca gcttgaaggg gaacgtggcg 14580 
cggtcgcggg cgtccatcga ctcgatggtg 14 640 
tctatcttct tgacccgttc gaacgagaac 14700 
ctggagaagg tgatgtcatc ccgcagccgg 14760 
aaggagcagc tttcggctgc gtcgggaacg 14820 
agtgtctgga agacattgct gtacagagtg 14880 
tcgagtgacg tcggcggttc cgtcgtcccg 14 940 
gtcgggtaaa gcagcagacc ggctgccagt 15000 
gacgcacgga tgtgcgcacg aataggtctc 15060 
acaggaattt tcgtacccct gggtggagag 15120 
tccttcggag cccgacggca cttccggaac 15180 
tatgccccgt tcggcgagag ggccgccagg 15240 
cctcgtctcc ggtggtcctg cggacccgct 15300 
cgatgccatt ccctgtgcaa tgtcacctgt 15360 
gccaagtcgg gaggtcgttc gtcttccgtc 15420 
tcctctcccc cttcagtcgg ccgtgctccg. 15480 
gagaaagtca ctggtacggc gaggtctggg 15540 
catgagcgtg ggcctgctgt cgcagccggc 15600 
gcacgtcgcg gcggacgacc cggagcacgc 15660 
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gggacccccg cccgtcgcgc agtcccccac 
cacgagggcg tccgagcttc cgcccgtggc 
cggcaagacc gcgaaggcgc cggtccgtcc 
caagaccggc aactcccgtg cgcgtgccgc 
gagcggcggc gcgctggtcc agcagatcaa 
gttcaacctg accgggaacg acgcctacta 
cgcctacgcc ctgcgcgacg gctcgacgtc 
gcagctcgtg ctctacctgc gcgccggcta 
gggcacctac ggcagcagcc tgcagaccgc 
cagcccgcac tcccgcgacg tcaacgacgc 
gctcatcgac agcgccgagg agaacgcccg 
ggactacgac tccacctgga actcgtcgtg 
cacggtgacc ttccgcggtc accaggtgcc 
cggcctgatc gacgcgctct acaacttcgc 
ccagtcctac ctcacgtcga acgcgggacg 
actgcgctcc aaggtcagcc ctctggccgg 
ccggacggcc ccgctgtggg tcggtgtcgc 
ctgctcctac tacggcacct gcgacctcca 
gacctaccca tgcagctcca gcatcaccat 
gtcctccagc tgcagcagcc tgcgcaacca 
caacggcccc gtcgcgaacg acaacaacag 
caccgactac cagacctacg ccggcgcgat 
gtacctggag gggaatccgt cggcggccgg 
cgagtggctg cgtccggact tccagatctg 
cgacggccgc ttcgacatgt acggcgactt 
gtgggtcgaa ggcttcgccg agtacgtctc 
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cgccgagacg gagcacgtcg cgcagggacg 15720 
cgcgagtaag gacgcgctca aggaggtgta 15780 
ctcgaagtcg acggacaagg cggtcgccgg 15840 
cgcgtgcaac gtctccgact tcaccagccg 15900 
ggcgtccacg accgactgcg tcaacaccct 15960 
cgccttccgt gagtcgcaga tgacctcggt 16020 
ctacccgggc aacgcctcca ccggtatgcc 16080 
ctacgtgcac tactacaacg ccggcacggt 16140 
gatacgcgcc gggatcgacg ccttcttcgc 16200 
caacggcgag acgctcgccg aggccgtcac 16260 
ctacatccac gtcgtcaagc gactgctggc 16320 
gtggatgctc aacgcggtca acaacgtgta 16380 
cgcgttcgtg agtgccgtgc agtctgaccc 16440 
gagcggccac ctcgcgctgc tgggaacgga 16500 
tgaactcggc cggttcctgc agcattccgc 16560 
cggcctgctc aactccagct ccatcaaggg 16620 
cgagatgacc gactactacg acaaggccaa 16680 
ggcacaactg gcccgctccg tcctgacggt 16740 
caaggcgcag cagatgacct cgggcgagct 16800 
ggacgcctac ttccacaacg tggtccgtga 16860 
caccatcgag gtcgtggtct tcgactccag 16920 
gtacgggatc gacaccaaca acggcggcat 16980 
caaccagccg cgcttcatcg cctacgaggc 17040 
gaacctcaac cacgagtaca cccactacct 17100 
caacgccaac atcaccaccc cgaccatctg 17160 
ctactcctac cgcggcgtcc cctacaccga 17220 
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ggccacgacc gaggcggggc gtcgcacgta 
cagccacgac accacgcgca tctaccgctg 
aaaccaccgc gccgacatgg acaccgtcct 
cgcccgcagc tacctgaccg gcaccatcgg 
gctggcggcc tgcgcggccg gcaactgcgg 
ggcgcccacc gccgcgttca ccaccgccgt 
gtccaccgac gccgacggca ccatcgcctc 
ctccacggcc accaaccccg tcaagacgta 
gaccgtcacc gacgacaagg gagccaccgc 
cggcggaggc ggcggcaccg aatgcaacgg 
ccaacgcggc aaccagtccg ccaccaccgg 
ggccggcacc acccagctga agatcaccac 
ctacagcacc agcggctggc ccggcaccac 
caacaaccac accctgacca tcaccaaccc 
cgccgtcagc agcttcagcg gcgtcaccgt 
ccaaggcacg accctcacga cggcccgggg 
gcggcggcag gggggagacc tccgtcgccc 
cgggcatccc tacctcccag gaggcagagc 
cgcgccatgt ggtccctcat catgtccgtc 
gtcggcagcg gtgaccaggg cacgtcacgg 
caactcgcag cggaccagca catctccacc 
gagcggctca ccggcgtcgc aacagcgctg 
gcctggatcg accagaagca cggcggcagg 
acggccctcg tcgaggcccg gtccgctcag 
gtcgaccgca gcctgcggca actcgaccgc 
gcagcgaaca agggcgccgc ccacggcctg 
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cgcgctgagc accctgttcg acaccacgta 17280 
gggctacctc gccgtgcggt acatgctcga 17340 
cagccactac cgcgcgggaa actggaacgc 17400 
cacccgctac gacaacgact ggtacacctg 17460 
tggcgggggc accaacccgc ccgggaacca 17520 
ccagggcctg aacgtcacct tcaccgacca 17580 
ccgctcctgg agcttcggcg acggcaccac 17640 
cgggtcggcc gggtcctaca cggtcaagct 17700 
caccgccacg aggacggtca ccgtcggcag 17760 
gaccgacacc cgggaactgg gccagaactg 17820 
caactacgcc tacctgtacc tctacgtccc 17880 
ctccggcggg acgggcgacg cggacctgta 17 940 
gagctacacg cagcgggcca cgggagccgg 18000 
gccggccggc gccaactaca tcagcctgca 18060 
gagttccgcc tactgaccca cggctccgca 18120 
cggctctccc cgccccgggc ggcgtccggg 18180 
cggaccgaga acacatcgcc cgcccgcaca 18240 
gtgaagtcat tacccgcacg caggcgacgc 18300 
ggtctcacct gcgcactcgc cacacccgcc 18360 
ctcagcgcct cgcaacaggc cgcggccggc 18420 
caggaggcac agcggcgcgt actgcggcag 18480 
cgtgagcgcc tgggttcccg cttcgcagga 18540 
ctgaccgtcg ccgtcacccg gtcgacggcc 18600 
gctcaggcac ccgacacgac caccgtcgtc 18660 
atgtccgcag gactggccca ccgtatcgcc 18720 
cagtccgcgg tggtggtgca ggacaacaag 18780 
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gttcgtctgg acctgccacg gggcaagacc 
tgggcgaagc ggaccctcgg cgatggcctc 
cccttctact gcggcggcca gtactcgtgc 
tacggcacga acgtccgctg ctccagcgcc 
atgatgaccg ccggccactg tgcggaggac 
tacggctacc agggggtcgg tcacgtcgcc 
gcgatcgtca gggtcgacga ccccggcttc 
acccgcatca ccaactggga ctacgactac 
acgaccggct acacctgcgg gcagatcacc 
cgcaccctga ccggcatgac ctggtccacc 
ggcgtctacg acggctcaac ggcccacggc 
ggcatgatcc acgaaccgat cagccgagca 
ggctaagcag cccgggcgga ccgtgagtac 
ccgccgcacg cgcggtcggc gtctttcccc 
tgggggcggg ggcgtggcac gtcgtgcacg 
agaccgtacc ggggagcgcc tttccggctc 
gcgtcaccca tttctggaag tgcggagcct 
catgatggtg caccacctcg acggccgacg 
gatgccgctg ccacagcgga tcccccgcga 
gcatggtggg ctggacggcg gccaccccca 
cgagcatgtg gaactcgtgg gtgacggcgg 
agaaggcgcc gggccagccc accccgtcgt 
ggtcggccaa ggacacctcc agccggtgcg 
acaccgggac ggttctgata gctcggtggt 
ctgggtagtc gcaacccagg gcgacgtcga 
ctccggtcgc gtacacactg ctgaccgaga 
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ctcacccccg cccagcacgc agtcgtggag 18840 
gaggtcagca cctacgcgca tgcctccgaa 18 900 
gaccccccgc tgcgctcggg cctggccatc 18 960 
ttcatggcgt acagcggcag cagctactac 19020 
agctcgtact gggaggtccc cacctacagc 19080 
gactacacct tcggctacta cggcgactcc 19140 
tggcagccgc gcggctgggt ctacccctcg 19200 
gtcggccagt acgtgtgcaa gcagggctcc 19260 
gagaccaacg caacggtgtc ctacccaggc 19320 
gcatgcgacg ctcccggtga cagcggcagc 19380 
atcctcagcg gggggccgaa cagcggatgc 19440 
ctggcggacc gcggggtcac gctgctggcc 19500 
gccgccccgg tcacatcacg aggacgtcga 19560 
gtgctccgct ccgtccgcca cccagcggac 19620 
ccgcagcgcg gtggaacccg tcggccgatt 19680 
cgttcgtggg acgggcgggt gcgtatgcgc 19740 
gcgacagcag ttgccagtgg gcgcgtacgg 19800 
cctcgaccga atcccgccgc cagacgagca 198 60 
ggggtttaac cagtactccg cccaccgggc 19920 
gacccttggc gatcatcgac tgcagttggt 19980 
gcctgaatcc cgcggcccca caagcgtcgt 20040 
ccgcggagac gaaccacgcg tcctccgaca 20100 
ccagtgggtg atcggcaggg gtggccacga 20160 
ccagcttcgg agagtgtcga agaggcagcc 20220 
gctcgccggc ctctaggaga tcgatgagtt 20280 
cggtcagatc ggggcaggct tcacggagga 20340 



WO 99/37660 



PCT/US99/01313 



cgtcgagcaa ggtgggtacc accggtgtgt 
ccccggacga gcggggaggc cgcagccgtg 
gggcccggcc gacgacctgg gcgccgtagg 
ggaagacccc ctcgccgagc agtccctcga 
gggtgtatcc gagcgccgca gcagcccggc 
gcacgcgcaa gtggcgcagc tcaagttcca 
tccattatgc gccaggagga aggcggtggg 
accagtagcc cgagggttat ggatgagccg 
ccgggtgacc gccgtggaaa tctcggacct 
gcctgaagcg caccgccatc gaggagcgac 
gtcgaagcga ccggggccgc ggttcaccgc 
tggtggggag ggccggtcga ggaagagctc 
ctcctcacca acgggctgcg gcacgcgggc 
cacgacatgg tcgtcgtcgc ggtcctcgat 
acggaggcgg acgacgagtg cgggcgggga 
cggggagtcg agaccacttc ccgcgggaag 
ccacaggagc gggctatcga gtcggctccg 
gaccgggaac gctggtcact ggctccccaa 
gcgatgtgag ttcgtcctcc tcgggcggcc 
ctgagggcgt gatgacgctc gtctgacgct 
agaaacagcc tttgacctgg ccttttctgc 
cgcaccacgt ggagtacgtg gcgattctgg 
ttttgttctc ctgcggcggg cgcggcaagc 
cggtgctcgt gtgtccggcc ggcgggtggg 
actcggacgc ctgagccgag atgcggttcg 
ggtcagcgac cgctggcacc tgtggaagaa 
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tgatggcccc gaggcgaagc cgacgtgtcg 20400 
cgagattgtc ggagagcgcc aggatctccc 204 60 
cggtgagctc cacgcccgcg ctgctgcgca 20520 
tgcggcgcag ttgggtactc atcgccggct 20580 
cgacgccccc cgcgtcggct atcgcacaca 20640 
cgggggcacc tcgctccggg cgaacagagt 20700 
gaatccggga cggcctgacg ccttcggtcg 20760 
gagcctctgg tatggcctgg ccggttgttc 20820 
gcgtgttggt ccgcagaggc gactgcggaa 20880 
atcatgcctc acacctgcat cagcttcacc 20940 
gcccgccacc gcgtctccac cgcgctgagc 21000 
cgcttcagcg cggaactcgt gacctccgag 21060 
gggcccatga ccgtcgagtt gacgctggtg 21120 
gacagccggg agctgccgcg gcctcggcag 21180 
ctcgccctga tcgaggacct cagtctgata 21240 
cgctgctggg cggttctgcc gctgcggacg 21300 
gctgaggagg cggaccacgg cttcgaggca 21360 
ggaagcggac tactggcgag tctgtttccg 21420 
cagtagccga cccagggcag gcgggcgtgc 21480 
ctggccgctt tcaagctgca cagcgagccg 21540 
ggctgcctca ggccgacatc tttccgatga 21600 
agcctgctgg caaggggttc tgacctgcgc 21660 
tcgtgcgggg cagttgggtt tcccgaaggc 21720 
ctgccttcgt ttcagtgggt gcgagagggc 21780 
ttcggcacca tggggtccgc aggatgaccc 21840 
cctttgcgac aaggccctgg ccgaggttcg 21900 
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ctcccacagc gcctgctgga ccacagcgaa cacaccccgc ccggtcggcg tccatgagca 21960 

gaccacccgc gaacgttggc atcagctcca cgacctcctc ggcaagggtg tcggcttgct 22020 

cgaatgcgcc cgccgcctga acctgtccct caacaccgtc aagcgctacc cgcgcacccg 22080 

cgatcctgaa gccctgcgcc ccgtgaagca gctgtttcgc gaggtccagg agcagggctg 22140 

caccggcagc ttcaccctgc tctaccgcag cacccagggc cgggcagaag gcgaccggcc 22200 

cgtcggaggg tcgcggcttg acgctcaccg tatccatcac tggaacggcg acgtctgatc 22260 

ccgtctgccc ggggcttggg tcccggctgc ggcccgtagg cccggctcac cccagcaccc 22320 

atcactgttc gagagtgatt acctctccgc cggacacatg gaaatctgca tcggctggag 22380 

tagacattgg gcagcagtgt ggttatgttt ctcctgtaac ccagaaggac cgcagggccc 22440 

ggcagagacg aactgccggg cagcagtacc cgcagttgca ggacggtgcg gtggtggagt 22500 

gtcgaagcca ggatggtgca ggacggcgac gggactgacg accggaccgg gcggcccgca 22560 

gtggtcaggg gccgccaccg cagtgcagta cccagcagcg aagtcagtga gcggtacctc 22620 

ggtgaaggcg tcggctgcgg acgcgcgcgc cgggaggttc ggcagtggtg gttccaagcc 22680 

agagcagacg caggacgggc aacggggccg actgtcggac agtggcgctg tcacaggtca 22740 

ctgagaggtt cgtgtcacca gcagtagagc agtaccagag -gaaagaacgg aggaaccaag 22800 

cgccatcagg atcgcccggg cgcagttttg ggcccgggta ccgcaggaca tcgatagtga 22860 

ggtggtctcc ggtcaagaaa ccgcgatccc cgcgcccccg gcagcaggca ggtcgggtcc 22920 

gcggacacag aaggccggtg cagtatcagg gccggcagat ggtgtaggag ttccttcggg 22980 

gccctggtgc cgcatggcac cagggcccct ccatgcgttc cgcagagagg tgcagatgac 2304 0 

agcagacgat tcgtacggcc gtctcgacga cgacgattac cccgcctaca ccatggggcg 23100 

ggcggccgag atgctcggta cgacccccgc tttcctgcgg gccgtcggag aagcccggct 23160 

gatcacgccg ctccgctcgg agggcggcca ccgccgctac tcccgctacc agttgcgcat 23220 

cgcggcccgc gcccgcgaac tcgtcgacca gggcactccc gtcgaggcgg cctgccgcat 23280 

cgtcatcctg gaagaccagc tccaagaagc gcggcgtatc aacgaggaac tgcagaggcg 23340 

cccggccggc ctggtggaca aggccgaggg ctgaggccgc atctgccggc cggtcctgtg 23400 

agggctcgcc tgccaagacg ggaagccctt gccgcaacga gaagaggcaa ctgtccgcac 234 60 
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cgatgtgctg ggcccggtcc tggctaggac tcccgtcttc ttgccggagc gatgcggctg 23520 
tggacgcgga accggacggc agtgtcgtcg ggcgcggagc gcggggcgca cgtcgatggc 23580 
gacaggaccg gcgaaggtgt attcgtgttc ggcggtgtga cggcgcacct ggccggcgag 23640 
ggcggcggcg caggtgtcac agggacatcg gttccgactt ccaccacccg tccgggttcc 23700 
accagcgtgt catccacctg atccaggctg ccgcggtagg tgctcgacgt cgggggtgta 23760 
cgggggcagt tgtaccgttc cgccgcgagg agtgacccga ttgaccaccg gcctgtggcg 23820 
ctcaggaacg ggctggactg tcgcagtccg ggccaactca agcccgacca tgaggccgac 23880 
cacggcgccg cgcgaccccg accacagcta cacgcgtggc atgaccaagg cggcacatgc 23940 
ttcgaacgag ccatctcatg tgtgccggta tgaacgtgat cgacgtcccc ggcactctgg 24000 
tgcggacgca agccgtctgg ggcgccaccc acgactggct cgccgccccg cccccgcggc 24060 
gccaccgtcc gtcccgccct gcgtcgtcgc ccgtgtggcg tcatgacggc gacagacttc 24120 
ctcgcgtatg ggccgaccat acggccaacg ccagaggtaa agcgctgtcc atggtgagtt 24180 
ccctgaacag aagggctggc gggacctcct ttccaagacc gtgctgcagg agtccgtcag 2424 0 
agcgcaggta atcccgtgct gtccgcgacc cagggctgtc ctccgtctgg ccgagggtcc 24300 
tcgtcttctg ggcgacatcc ctttagcgtg ggcggtagcc gccgaaggga ggcgccatgt 24360 
cggacgaatt gacgggcccg ttgggaacgg caatgcggga ggtcacgttt ccggaccggt 24 420 
ctcgcgggat catcttggtg cgggctggaa caccgcaggc cgaggccgag gcaatggccg 24480 
cccgtatgtg ggccgagatg ccggaaggct gacgtgcccg aacgcagaca acccgtaccg 2454 0 
tcctcacacg cattcccctg agccgtcggc catggaacgg aaccagccgt acgaaccccg 24 600 
gaggcgccgt tgcggtctct gcggcgaggc cggggccacg cagggcgaag aggccgcgcc 24 660 
gcgttctgcc gcctggcgcg gctgccggct gttcacgaga acaccgaggg aggagtcgcc 24720 
cgcctcttgc ccggcgcgtt gccgggtgga gagcaggtgg tgaaggactg gctcgctgaa 24780 
ggcggccgag gcgacctcgt cggccggcct gaacggcttt cactgtccca gcggcggcag 24840 
gccgccgaca caggcatgct ttgccatctc cctcgctgtc tactgacccc agcagcagga 24 900 
tccagtacgg cgtcgcggcg ctgccgcctc actcgcgcat cgatcgggga atgcggcatg 24 960 
tggtgagggc ccggccggcg tgccggtcgg gccctcacac tgttttggtg tcggcgcgtt 25020 
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tgtcgtgtcg gtcagacgga caggtggggg 
ggactgtcgc tgcgcgcggg ggcggcctcg 
gccatggccc ttaggatttc gccggtgctg 
tggccgacct gcttggcggg gtagtggcgt 
ggttcgggtt tgacgccctt catcgattcc 
aagcgggcaa tgacacagcg catgatgcct 
cgtctggcgg ttcagcggga gagagcgagg 
acgggcacca gcccgagccg tccgaagggc 
gacggtgaga cgaagggctc gctgtcgtcg 
tatcgggagc tgtcccggac ggcggtgagc 
gcatcctcgt cgcagacgac cagatgctcg 
acctcgacgg tcatgtcgta ccagacctgt 
gtgccgcgca atgggagagc gcctacggag 
tgcctcctgc gcagacgggc ggggttcctg 
cgtggacttg agtgcggggg tacgccgcgt 
ggtggcgccg cgcttcttgg ggcgttcagt 
ggtcggggcc tgggctccgg tgatccggct 
ggtctgcggc cggatcccgg cttccgacat 
ggtgacgagc gtgacgacgc tgccggactc 
gaggtagtcc ttgtggtcgg tcggcgggtc 
gtggattccg cgtgccgcga cgttggtcgc 
ctgcgccaga gtgcgggtgc gctgcggctg 
taccccgctg ttgagcaggt cccgggtcag 
catgatcacg cggccgtcgc gtgcggcgat 
gtggacatgg agtacgtggt gctccatcgt 
gtgcacgacg gggtcgctga ggtagcggcg 
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gcgccgagca tggcggaagc ccgctgcaac 25080 
ggaagggtgc ggcaggtgaa gcccagctgg 25140 
aagtcgcggc ggtcctggcg ggtgatgacc 25200 
cgtccgatga tcacggactc gccggtgacc 25260 
agcacgccgc tcttggtcag gtcgaacggg 25320 
cacaggcagg agagttacgg ggccggccgc 25380 
acgcccaggg cgctgccgtg ttcgtcgacc 25440 
accgcgtcct cggcttcctc cctcgtggcc 25500 
gtgatgtcac cgaggcggag ccggtcggtg 25560 
cgggcctggg tgaccaggcc gacgcaccgg 25620 
gcacgggcgg cggccatcac ggacagcgcc 25680 
ggcccggcgg cgtccatgac gtcggccacc 25740 
cgatcctgca actgtcctgg cgtcaagggg 25800 
atcaggacgg tcctaggcgg ccgcgccagc 25860 
cgccgaggcg gggcggcgtc ggccgcgtga 25920 
cgccggggcg gtgatgacga ccgggatgcc 25980 
gagggcctcg tcgcccgggc tgacctgggt 26040 
gagacggacc atgccgcggc gctggttcgg 26100 
gccggcgcgg gccgtgcggc cgccccggtg 26160 
gacgttgacg acgaggtcga ggttgtcgac 26220 
caccagcacg gtgacgtgcc cggtcttgaa 26280 
ggacttgccg ccgtgcaggg cggcggcccg 26340 
tctgtcgacg gcgtgcttgg tgtcgaggaa 26400 
ctcggtggtg gccgcgtgct tgtcggcgcc 26460 
ggtgacggcg ccggccgagg ggtcgacgga 26520 
tacgagcagg tcgacgttgc ggtcgagggt 26580 



WO 99/37660 



-104- 



PCT/US99/01313 



ggcggagaac agcatgcgct ggccttcggg 
cggcatgaag cccatatcgg ccatctggtc 
gttcaaccgg cagtcgccgc ggtcgatgag 
gacctcggcg ccaccacgca gcgccgacgc 
cgtggccagc cgcagcttca cagagcggge 
cgccagctca cgtgtcggta cgaggaccag 
gccggccgta cgggccagca gagccaggcc 
cccgcggccc atgatgtcgc ggccggcgag 
gaacggcacg gtcacccctt gttggccgag 
atcggcgaag ccctccgcag cgggaagcgc 
cccctgaacg gcgccgggcc ggcggccgta 
gcgcggcgcc ggcgaaccga agcggctgcc 
ggtgcgagcg aagcggtcgt tcgtgcgtgt 
cacatatcaa ggaatttccg aagcaatgag 
gggccttgcc agcggatctg gccgacagaa 
gggggtgctg tgggctcgat attcgaagcg 
ctgcaccttc gaaggacgat ccgtgtgcgg 
aggtgaaatc actgcgggaa acgcatgtag 
ccagctacaa gtacgtgaca gtcggcgtca 
cttctggccc tgcgcgatgt cgaagttcac 
ctgggcggcg atgttcgagt agtgggcgaa 
gaagccgaag cccttttccg cgttgaacca 
ccttcggggc agtcgtacgg gatccgcacc 
cccgcccgga aaaagaccgg agatgtaaaa 
gcacttgaaa tttcgggaac cacaactgca 
ccactgtgcg gtgaagaacg ccaccttgct 



acgcacctgg tcgagcagtg cggtgacctg 26640 
ggcctcgtcg aggacggtga cggagacctg 26700 
gtccttgaga cgtcccggag tggcgacgac 26760 
ctgcctgccg atcgacatcc cgcccaccac 26820 
gtacggggtg agcgcgtcgg tgacctgctg 26880 
ccccagcggc tgccgaggct cggcccgccg 26940 
gaaggcgagg gtctttccgg aaccggtgcg 27000 
ggagttcggc agggtcgcgg cctggatcgg 27060 
cgcggccagc agttccccgg gcatgtcgag 27120 
gggggtgatc gtccggggga gggcgaactc 27180 
accgccggag cggctgggtc cggccggccg 27240 
gccctttccg gagtcggcac cgccatgacg 27300 
gcggttcata cggaaccttc ctcgatgcgg 27360 
cagcacggag aatcgcaaga atggaccggt 27420 
aatctgtgcg gcacgtgcgc tggaatgatt 27480 
tccactgcac tgtagctatg aaggatgcgg 27540 
taaacacacg ctgtccggag cgtcgtccgc 27600 
ctggggcccg caccccgaag gatgcgggcc 27660 
ggcgggaacg atgttctcgg ccgtcgggcc 27720 
cttctggcct tcgagcagct cgcggaagcc 27780 
cacatcagcg ccgccaccgt cctgctcgat 27840 
cttcacggta ccagcagcca tgtcatttct 27900 
gcgcggacct cgtgtcgccg caatgatcac 27960 
gtgcttccag gggtactgag cccgaccgga 28020 
actgacatcg acagtagcac gccacagcag 28080 
tattgcggca gagaatctat ccgcatgctc 28140 
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cgatgaaaac 


tcaaaccgcg cgcacagata 


gccgcgctcg 


cgtgatccgg tcccccacca 


cgcgggggca 


gacaggtccg gccatgacgc 


ccggtcgggt 


gtacgtctcg cgcgcggcga 


gtcttgcctg 


gcaaccggct gtcggctcgg 


gttctgcgcg 


cccgcttccg tgaacgcgcc 


cctcaggagg 


cgtgacccgg ggaagccgag 


ttgccgccgt 


aggtgacgtc ctcgaagtat 


tccgtcaccc 


cgtatgccct ggccagttcc 


ttcgcgcggt 


ctgggtcggc ggccagcgcc 


tccgcgatcg 


cgaacgtcgg ctcttgggcg 


ccgaagtggg 


agagcatctg ctccgaacgc 


ccctcgtact 


ccgccagctc ctgagccagc 


gccaggtcgt 


agtagatgtt ctcgcggtag 


acttccacat 


gcagcggcgc gtcggagcgg 


atcacgtgcg 


agcgcgcgcc cagctccagg 


cagctcttct 


tcccgaacac cgaggtggcc 


acgagaatgt 


cgagccgctc gtactcccgg 


gcttcgtcga 


gatggtcggt gggaactgcg 


tcggcggtct 


cctcgatggt ctcggtcgtc 


gttcggccgg 


tcacatacac ggtagcgccg 


cccgccccgc 


gggtagcgcc cgccacgagg 


ccggcctgct 


cggtgttctc agtggtctgc 


atcgttcacg 


ctaaaaccga cagaacacgt 


tcccggccat 


agcgccaact acgtcctcgc 


gggtgacgcc 


ggtcaggtcg gggtaggggc 



ttgaccttcg cgcgacgcca tatatcgcat 28200 
cgctctccgc tactgcacgg gtcgcaccgc 28260 
cggccatgct cggggcgtag cggacgcctg 28320 
gcactgcggg ggaggggccg gttgccagac 28380 
gctggttggt cagccgtggc aggtgatgtg 2844 0 
gcagccccgg ctgccttcta ccaggccgac 28500 
gatcagcggt agtcgtcagg ggaggcttcc 28560 
gcccaggcat ccggccggct gccgtccacg 28 620 
ccgctggagg tggacttgcc gttccaccgc 28680 
gcgaccgtac gggccaggta gtgcggggac 28740 
atcgcgtcac gccagttctc ctcactcaca 28800 
aggaagcccg gggacaccgc gaccgccgtg 28860 
ccgaacgcga ggcggatcgg ggcgttcttc 28 920 
cggcggttgg agtgcgcggt accgtcggtg 28980 
atcagcagcg gaagcagcag cgccgccgtg 29040 
atccgcaggc cgtcggcgag cggtgtctcc 29100 
agaaggtgct cgccgcccca caggtcgttg 29160 
tcgatccgct cgacgagggc gcggacctgg 29220 
attccggtgc cgcccgctgc ggtgacgagt 29280 
cggccgacct cgctggcccg ggcccgggtg 29340 
gcccgcccca gttccacagc ctgagctcgt 29400 
gcgatccgtc ctgccagcgg acccttcgga 294 60 
ctggtgatgt cctcgttgct catgtcatcc 29520 
caccttttat gtggggggta ccgcgcatca 29580 
actgagcgtt ttcagcgtgg gccaccgatc 2964 0 
cgcaacgcac aaggctcgcg tgcacgacat 29700 
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ggccaccgcg cgcatgatct cccagcggga 
gagaccagct cacccgggac acccggtccg 
agacgcgttc agcagcccac atcccaggag 
ggaccatcgg gctcggcacc gggagtgcac 
cacggtaagc ggtgaggtgt cgaggtccag 
cggccgaccg gcatgccgag cgggcagccc 
ggcacgggca tgggcggcac ccccaccccg 
gtgacgggct tcgggtcaga cctgtgcggg 
caggttgatc tcgccgtccg tgtgccggac 
gatgctgagc tcgtactcga atcgcgctcc 
gagcacgttg acggcaccgt cgtgcggggc 
cgagcgcagg cccacggtgg cggacgaccc 
ctgaccggtc tcacccagtt cgacctgtac 
ctccaggagg ttgcagcggc cgatgaagcc 
gatctcggtc ggtgtgccca cctgctggag 
ggacagggac atggcctcga cctggtcgtg 
ccgctggagg tccttgagcc agacgcgggc 
gagcggttcg tccaggagca gcacgccggg 
gcgctgctgc tgtccgccgg agagctggtg 
gaccttggtg aggacgtcgt cgatgaggcg 
cttcagcggc agtgcgaggt tgtcggcgac 
gaagaccagg ccgagattgc ggccttcggg 
gaagacctgg tcgccgacac ggatggtgcc 
cgacaaggtg gtggacttgc cgcagcccga 
cgcgacggtg aagttgacgt cctccaggac 
ctcgacgacc agctcaggca tgcttcttcc 



gcccagccgt ccccggcagc cccagccgct 29760 
acaccgcaca cgatcaagta gtcgacctcc 29820 
ccgtctaccg tcccaggaac ccctgctccg 29880 
agttgatcag taactggcaa cgagctcgtg 29940 
atgggcggcg gcggtggtgc ccccagcggt 30000 
accggtgtgc cgagcggcgg acccggcggc 30060 
cagcacctga agtcggtcag gaccggccgc 30120 
gaacagcagg cagtcgtccg ggcggatgat 30180 
ggggctctcg gcatggacgc gcacgtcgcc 30240 
ggtgtacgag cactgctcga tcctcgcccg 30300 
gtcggcgcgg tcggtgagcg tgatgcgttc 30360 
cgcggagcag gcgccggcca ccctcaagcg 30420 
ggctccgccc tcggtggcgc cgacgcgccc 30480 
ggcgacctcg ggagtggcgg gagtctcgta 30540 
gtgtccgtgc atgaacacgg cgatgcggtc 30600 
ggtgacgtac acggtggtga tgccgacctc 30660 
ctggtcgcgc atcttcgcgt ccaggttgga 30720 
ggagtagacg atgcctcggg cgagggcgac 30780 
ggggtagcgg tcgcgcaggt gagccatgtc 30840 
ccgttgctcg cccttggtga ccttgcggag 30900 
ggtcatgtgt ggccagagcg cgtacgactg 30960 
gggcaccgtg ctgcgccggg tgccgtcgaa 31020 
cgagtcgggg gtctccagac ccgcgacgca 31080 
cgggccgagc agagtgaaga actccccgtc 31140 
cgcggtcccg tggaaggact tcttgatgtt 31200 
ccttcaggag gagaccggcg aggccggcga 31260 
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cgacggcggt gacggcgatc tggagggtgg cgagggcggc cacggagccg gtctcaccct 31320 
gggtccacag atcgatggcg gtggtgccga tgacctgtga ctcggctccg gcgaggaaca 31380 
tggcgggggc gtactcgcgg atcatctggg tccagatgag caggaacgag gcgagcatcg 31440 
cgggcacgag gagacggagc atgatccggg acaccgtgcg ccaccagtcg gcgccggcga 31500 
cgcgtgcggc gttgtcgagt tcggctccga gctgcatggt cgccggggag atcgcgccgt 31560 
acgccgacgg gagtgcccgg atgccgaagg cgatgatcag cgcgaagagc gtgccgcgca 31620 
ccgcgtcgcc gccgggtatc caggtgaagg cccagaacag gccgatgccg acgatcaggc 31680 
ccgggaccgc gtgcggtgac tgcgctgtcg tctccaggag acgggcgaag cggaagtcgg 31740 
agcggcgtgc cacgaggacg accaccgtgc cgaacagggt cacggccacc gcccccacga 31800 
aggccacggt gatgctgttg acgatcgact cggtgtaggg ggcgtagtcg aagatcagac 31860 
ggaagttgtc cagggtgagc aggtcgaacg ggttcaccag cggagtgagc agcgaggtga 31920 
acgcgcgcag gatgagcgcg- agcatcggca gcagtgcgcc gaagacgacg tacagaccga 31980 
cgaaggcgaa gcccagccac ttccaggcac cgatgtcgag caggtcggag cgggtcgcct 32040 
tgccgcgcac cgacacgaac cgctgggcgt gccccagcag ccgcgtctgg aacacgacca 32100 
gggcgatggt ggtgagcagc atgaaggtgg acgccgcgcc cagcaggccg tagtccggat 32160 
tgatcgagtc gatgccctgc tcgtagagga agttggagaa gagggtgatg ccggcgggct 32220 
cgcccaggat gagcgggatg gacagggtct cgatcgccgt gccgaagatc agcagacccg 32280 
cgtagagcat cggcgggcgc agcatcggca ccacgaccga gcgcaggacg cgcagaggcc 32340 
ccgcgccgac gctgcgggcc gcgttctcca gagaggtgtc ggaggcggcc agcgcgttgg 32400 
cgcagaacag gtaggcgatg gggacctggg cgacggcctc gacgaacgcc ataccgggca 324 60 
gtgagtacag gttccagggc acccagccga agccctcgcg caccgcgccg gtcaggaagc 32520 
cggccgggcc gtagacgacg atccacccga aggccaggac gagcggggag atgtagatgg 32580 
gccagcgcag cacctgcccg aacaggcggg cggcggggaa gcgggtgcgc tccagcagaa 32640 
tcgccatcgg caccgcgatg gcgagcgcga acacggtcgt caggacggcg aagaggaggg 32700 
tgtcgaggac gatcgaaccg aagcccgccg acgtgaacag gtgggtgtag ttcgagaggg 32760 
tgaaggcgcc gccggccgcg tacaggggct ggttgcggac cgactggtag aggatcggta 32820 
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cgacgggggc gaggacgagc acggcggtga cgaggaacgt cagccagtgg atggtgacct 32880 
cacgtccggc gccgaacagg cgccggtact ggggcgtgcc cagctcgccc gcgcgcggga 3294 0 
tgcgggacgg cgcgggtggc gccgggggtg tctggatggc catgacgact ccgtacgaac 33000 
ggggtgggga caggggcgtt gggcgggcgg gggcggctca gccggccgcc ttctcccagc 33060 
gcgcgacgta cgcctcccgc acgcgctccg gcacccgcac gggccggtac agatggacgc 33120 
ggtccgcgcc gagcctgcgc cgcatgtcct gcagactgtc catggcgtcc tggcgcacgt 33180 
ccggccggta cggcaccagg ccgccctcgg cgaccgccgc ctgcccttcg gcggagagca 33240 
ggaagtccag gaagagacgg gccgcgttcg ggtgcggggc ggtcttcacg acggacagcg 33300 
cgcgcggcat gacgacggtg ccctccgcgt agtagctcca ccccagcagt cccccgctgt 33360 
gctcggcggc gggtatcgcg acgccatcac gtacggcagg atcccgccgt gccgcaggta 33420 
ggccagttcc tggcgcgagt ggaggcgcag cctgagacgg accgtggcgc ggggtgcgtc 334 80 
gggccggacg agggacaggg cgacggggtt cgtgccgacg cacaggtcgg cgaggccg.tc 33540 
gaaggtgaat tcctcctctc cggtgaaggc gtgggccgat gcggtgtcgc cctcctcgaa 33600 
ctccaggggc agtacaccca tgccgatcag gttgttgcgg tggatgcgct cgaaggactc 33660 
ggctatcacc gcccgcactc ccagcagcgc ctgtgccttg gcggcccagt cgcggctgga 33720 
gccggcgccg tagttgcggc ccgcgaccac gacgagatcg tggcccgcgg cgcggtaggt 33780 
cgccgcggct tcgtggacgg gccccatccg cagttgcgtg ccccgatggc cgccctgcgc 33840 
atggacgatg cggccgggtc ccttcgaggg cctggccgcc gggtcgttcc tgacccgagg 33900 
cgccgatcag gacgacgccg tgcaacggac gcgaggacag cgtcagcttc gtccgcggca 33960 
acagcgacga ccccggtgac ttccgatgac gcgcacgccc ccgccgcccg aacccgagct 34020 
gaccgtcgac cgcgccgcct gctctgggtc accctcccgc tccgcctgcg agatcagatc 34080 
gccgacgcgc cgccgggcac cgtcgtccac gtcgtcgcca ccgacccccg cggcaccgct 34140 
cgacctgccc acctggtgcc acatgacagg tcacacctgt ctcggcacgc ccccggcgaa 34200 
cggccggtgt acgccccgaa gctcaccgcc gacgcgcgcg ccacccgccc ggacgcaccc 34260 
tggcacccgc tccggcggcg gcaggagcag ccccggaacc ggtgacgcat ctcgtcggcc 34320 
ggccgtttcg agtggaccgc ggacgcggaa cgtcacggcg tccggaaacc ccggaaggtg 34380 
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accggcctgc gtgtcttgaa gccgagccgt 
tcggccacgt gcaggaaggg acgatcaccg 
acgaggcggg cggcataacc ctgcccgcgc 
tcggtccagc ccggaggacg caggcgttcc 
cggacaccca ggtaggtgcc gagttcatgg 
cgcgcggcga gatccagcat ctcaggcacg 
tcggacgcgg agcgagttcg gccggggcgg 
tcaagactga aaaccggctc ccaacccggc 
gcgaaggcgc cgggaccgag taggccggcc 
acggacacgg aggagaaggt cgccacgtcg 
cgggcgagat gagcgtgccg accactgagc 
tcgtcgtcgt tcatcatcgt gccgtttcct 
cgcggtaggc gaaaagtcgg cggcggggcc 
gtgcagtttg ccgggtcgcc tgaggacttc 
tgccggcgga ggcacgcctt ctgacgtgcc 
gagcttgagc gatgcagcgc cgccggaagt 
cctcgttctg ggtgagagca gtcctgcctt 
ccggcgaagg ccaggacctg tcccacggag 
cagcttcagg cccgacgtag ggggaggggc 
gtcggtcaga cctgttcgaa gtggaacgcc 
gcgaagagga tgcactccac gagggactgc 
gtggtcgccc actcctcgga gagcgggtcg 
atcacccgga ctccttgggc gcgcaggcgc 
gcgctcttgg ccgcgtagaa ggcgtcgtgt 
caggcggaga ccatcgtcac gacgtcgggt 
ctcctcgtgg tcaggaccgt gccggtggct 
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tcgtacaagg cgatcgcgcc ggtgttcgcc 34 440 
cgcgccgaga tgcgctcggt gagagcgcgg 34500 
gcctcgggag cggcgcagac ggcgctgatc 34560 
ccggccatcg ccaccaggqt gccgtcgacc 34 620 
gtacggggcc agaacggccc cggctcggtc 34 680 
ctgtccgcgc ccagctcgac cacgtcggtg 34740 
ccgtcgccgg gccaggtcat ctgacggccc 34800 
ggcggaacgg ccggggagct gaacatgtcg 34860 
aggtcggccc agtcctccgc gtccgggtcg 34920 
gtgagatagg tggctgctcg accgaaccgt 34 980 
gactgaccta ccgggtcgtc gagtgcgggg 35040 
tcctggtgag cgcggtggtc gaagggtggc 35100 
cgtggcccga tagtcgtagc ccttgtcacc 35160 
cggctggagg ccaatgccaa agcgctctcg 35220 
tccaccggca ccactcagtt caggcagatt 35280 
cgagcgcctc aatgcatcga gcggcgactt 35340 
gtcgagtgat gcggcgttcg gaccgtcacc 35400 
tggctcatcc acctccccct cctcggccca 354 60 
gactcggaac ccggcgtccc gctcgcgaag 35520 
ttgatgaagc agtcccgggg ttgggcgacg 35580 
gcggtgaggg cgtcctcggc ttcgcgtgaa 35640 
gcgttgtcga agtcgggcgg gtagagcgag 35700 
ttggagagga tttcggtgaa ccctgcctgg 35760 
gcgtccgagc ggtggtggcc cggtgttccg 35820 
gtgtccgagt tgagcaggag ggggaggaaa 35880 
ccggaggcga tggtgtccac gacgtcggcg 35940 
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tcggttgccg acagcaggtc cggcccggtg 
tcgacgcggt cggtgtgttc cgcgacgccg 
gtcaggtcgc aggcgaaggc gtgcacccgc 
acccgttggg cggcggcgag ccggcgtgcc 
aggcggatgg ccagggttcg tccgaagtcc 
ttgtcccatc tcatggtgtc gttccccagt 
accgcgtctc tacgctatcg gtcatggtcg 
atgttgaggt gcccaacccc ggtgcggacg 
atcgggaagg tcgtgccttt gcccagaagc 
cctgggacat cgtgggcggt catgtcgagg 
gtgaagtcga ggaggagacc ggctggcgcc 
cgacctggac gggggacgac ggcggcggcc 
tggacggcga cctggaccac ccgaggctgg 
tcggccccgg cgatctcacc cgcctcaagg 
acgacctcat agccggtgcc gttgccgact 
tcaccagccc ggaccggctg cgcgagctct 
aggagaccga ccgcctgacc gaggagaccc 
tcatcggcag cgcggaccgc gagggccggg 
ggttcgtctc ggtgctggac gagcagaccc 
ggctcgacac cctgcacaac gtgctggaga 
ccggccgccc gaccacgctg cggatcaacg 
tgctcgcccg cctcactccc gtcggaaagc 
agcaggtgta tccgcactgc ccgaagtcac 
agtggatgcc cgccgacgcc cagccgagca 
tgcccggcct gaccctggac cggatcgagg 
acgaatgacg acgagtcgat gagcgccgat 



aggtagcggg agccgttgtt gacgagtacg 36000 
gaggcgaagt cgcggatcga ggcaggatcc 36060 
tggtgtccgc ggtcgcggat ctcgtcgcgg 36120 
gagaggaaga cctccgcgcc gaggtccgcg 36180 
cggccggcgg ccgtgatgac gacgcggtgg 36240 
cgccgtttcg tggatcgggt ggtgccgtgc 36300 
ctcacgaacg gtcgttcacg gtcaatgatg 36360 
aggtctggac cgtcggcgcg gtcatcctca 36420 
ggagccggga ccgtcgcctg ttccccgggg 36480 
agggcgagac gcttctggag gccctcgcgc 36540 
tgacccgtgt gcggcggttc ctcggcacca 36600 
tgcgtcacga ggccgactac ctggtcgagg 36660 
aatggtccaa gcactccgcc tacgactggt 36720 
agaaccgcgg accaggggag tacctgatcc 36780 
cgcctttcga cttgctccgg gcggacgccc 36840 
acccgcagcc gaacccgaac tcgctgcgca 36900 
gggcgctgat cggctgttcg tcactggtgt 36960 
cggacgtgac gccacgtggc ggcccggccg 37020 
tggtgatccc cgacgcgacc ggcaacaaac 37080 
ccggacgcct ggggctgctc ttcctcgtcc 37140 
gacgcgcctg tgtttcggcc cgcccggagc 37200 
cgccggtcac cgcgctggtg gtgcaggtcg 37260 
tgatgcgcgc cgacgcctgg cgacccgagc 37320 
gcgccgaggt gacccttgcg cagctgaacc 37380 
atgccgaacg ggagtcgctg cgcctgcggt 374 40 
gagccgatga gacccgacgg gatccgacgg 37500 
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gtcggcgtcc 


gcggcgagca 


gaccggtcgc 


gaaggtcacc 


gcccgcacgg 


cggcgaccct 


37560 


cgcgacggtc 


agtactgtcc 


ggtcaggtgc 


gggtccagcg 


ttggttgctg ccgttggagc 37 620 


aggtgtacag 


ctggatcagg 


gtgccgttgg 


ccgtgccgtt 


cccgacggcg 


tcgaggcaga 


37680 


ggccggactg 


gacgccgacg 


acggacccgt 


cggagttgag 


gcgccacttc 


tggttgtcgc 


37740 


cgccccagca 


gctgtagatc 


tggaccttgg 


agccgttgcc 


ggtgcctgcg 


gcgtccaggc 


37800 


acttgtcgcc 


gtagaccctg 


agctcgcccg 


cgtcagtggc 


ggcccactgc 


tggttggtgc 


37860 


cgctgtggca 


gtcccacagc 


tggagctggg 


tgccgtcgga 


ggtgctggcg 


tcgggcacgt 


37920 


cgaggcagcg 


gcccgaaccg 


acgcccttga 


tctgtccccc 


gtccgcgggg 


ggctccgagg 


37980 


agtcgccgcc 


gttgagtgcg 


tcgaggacgg 


cggtgtacgc 


ggccttcttg 


ctgccgtcgt 


38040 


tgttgaacag 


caacggcgtc 


tgctccgacc 


gccaggagtc 


gctgtcgcgc 


acaccccaga 


38100 


cggtgacgcc 


gaggcagcgc 


gagacggcca 


ggcagtcgtt 


ggtcacgttg 


gcgtaggtcg 


38160 


aggccggggc 


gccctggatg 


tccagctcgg 


tgatggccac 


gtcgacgccg agggcggcga 


38220 


agttctgcag 


tgtggtgcgg 


aagttgctgt 


tgtaggggct 


gccgctgttg 


aagtgcgact 


38280 


ggaagccgac 


gcagtcgatc 


ggcacgccgc 


gctgcttgaa 


gtcccgcacc atgttgtaca 


38340 


tggcctgggt 


cttggcccag 


gtccagttct 


cgacgttgta 


gtcgttgtag 


cagagcttgg 


38400 


cggacgggtc 


ggcggcgcgc 


gcggtgcgga 


aggcgacctc 


gatccagtcg 


ttgccgctgc 


38460 


gttgcaggtt 


ggagtcccgc 


cgcgctcccg 


aactgccgtc 


ggcgaaggcc 


tcgttcacga 


38520 


cgtcccactg 


gacgatcttg 


cccttgtagt 


gggccatcac 


gccgttgatg 


tggtcgatca 


38580 


tcgcctggcg 


cagcgcgctg 


ccgctgaggc 


tctgcatcca 


gccgggctgc 


tgggagtgcc 


38640 


aggccagggt 


gtggccgcgc 


acctgcttgc 


cgttctgcac 


cgcccagttg 


tagacgcggt 


38700 


cggcggagct 


gaagttgaac 


tggccccgct 


gcgg 






38734 



<210> 31 
<211> 3331 
<212> DNA 
<213> Unknown 



<220> 

<223> Description of Unknown Organism: Unknown 
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<400> 31 

tcggatctcc ccacacaaca tagatagagg atatccgcct gggttcacaa tgaagttact 60 
ggtggttctc accaccctcg tgggctttag ctcagcacta agtttcggtt gtaattacag 120 
accagtatta ggcttcaatt cacagtatat gctgggagga ctaagacttt tctgtatgcc 180 
tgccatggtt tatgatccat gggcatgtgg ttgcgtttcg gcatggagca gtgcaggtct 240 
ttacggtgtc ggagggggcg gaggcgcctg gggagctggc ggtgctggag gagccgacgg 300 
cggacgcggc ggcggcggtg gagattggga atatgactat gatgacgaca gcgatgacga 360 
tgatgaatgg gactgggatg atgacggtgg aatgggagct ggcgccggag gtggtgctgg 420 
tggtggtgcc ggaggtggtg ctggtgctgg tgctggagca ggcgcaggag caggagcagg 480 
tgctggactc ggacttggat tgggcggagg tctcggaggt ggacttggcg gacttggagg 540 
tcttggcgga cttggcggtg gagacgattt atttgattta gatttcgatg atcttggtgc 600 
agctcttgcc ctcggtggag ctggtggagc tggaggtgct gctgctgctg ctgcagctgc 660 
cgctgctgcc gccgggggtg gagttggtgg agctgctgcc gcagccgcag ccgctgctgc 720 
cgctgcagga ggaggcgcag gtagacttgg aggagctgct gctgcagccg cagccgctgc 780 , 
tgccgctgca ggaggcgcag gtggacttgg aggactcggt ggcggacttg gaggactcgg 840 
tggcggactt ggaggcctcg gaggtcttgg tggcctcgga ggatatggag gatctgctgc 900 
tgccgctgct gctgctgccg ccgctgctgc cggaggtgga ggactcggtg gtgttggttt 960 
ctacggtgga cgaggaggta gacgcggtcg aggaagagga ggccgcagac gtgctgctgc 1020 
tgccgctgct gcagctgccg ccgcagccgc tggtggtggc ggaggaggtg gaggtggtgg 1080 
aggaggaggc ggaggcgctg gtgctgccgc tgccgctgca gccgctgctg catctgcttc 1140 
agcttctaga caaatgagtg gtataaggga cgcattagga gacattaaag accttctcag 1200 
gagtaatgga gcctctgcaa aagcctctgc taaagcatca gcagtagcaa gcacaaaatc 1260 
tcaaattgac gatttgaagg atgtcttaaa ggatcttgca ggtctattga aaagctcagc 1320 
atctgcttca gcatctgcat ctgcatcagc ttcagctgga ggtggaggcg gtggtggtaa 1380 
cggaggtggt aacggaggag gaggcggcgg tggagctgga gctctagctg ctgctctcgc 14 40 
tgctgcagga gccggaggtg gacttggagg tggaggcgga ggcggagctt tagccgctgc 1500 
actagctgct gctggtgcag gtggaggagg ttttggtgga cttggaggac taggcggtct 1560 
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tggtggggga tctgccgcag ctgctgcagc cgctgccgct gctgcatcag gtggtggagg 1620 
aagagcactt agaagggctt tgagaagaca aatgcgtgga ggtggatccg ctgctgccgc 1680 
tgctgctgct gctgcagctg ctgctggagg tggatgggga ggtggaatgg gtggaggatt 1740 
cggagtaggt ctcggtggag gattcggagg aggatttggt ggtggatcat cagcagcagc 1800 
tgctgccgct gctgcagccg ccgctggatt tggtggaggt ggacgaagag gtagaggtag 1860 
aggacgtgga ggcgatggcg acggtaacgg agctagtgct gtagctgcag ccgccgccgc 1920 
tgctgctgct gctggaggat ctgctgctga tgttgccgct gccgctgctg cagccgcagc 1980 
tatgtacggt gacggtgctg atggacctga tttcgataat ggattcggtg gtggaaacgg 2040 
aaatggaggt ggcggatctg gtggtggcgg atccggcgga ggtggatccg gtggcggatc 2100 
tggaggtggc ggtggatctg gtggatcagg cggtggcggc ggatctggtg gttcaggcgg 2160 
tggcggatca ggcggcggtg gaaacaatgg atggggaaat aacggcaaca ataaatatga 2220 
cgatgatgac tgtgatgaat atggtaaccc tattagaagg gggtaaatta tttgacatta 2280 
tccgccattt gactcatttt tcttagttct ctatgtttta tacttcacct tagattgttt 2340 
tagtttgatt gaataaatta tgttttcgat ataaattttt tttaaattaa attaaacttt 2400 
attagttgac ctgtaaactt tttcatggag ttataatcta aggaacaaaa aacatacata 24 60 
atatgttcag tattgtggta aagcacctgt accgcaaaca caatcacctc tatacatgta 2520 
tacaaaatca gtaatgctga caaaatcttc tacactctca cctacacact cgcacacagt 2580 
cctcttacat acacagcact ataatatcct gaacatgaag tttgtgttga taaaaagttc 264 0 
agaaaaatct cccctacatc acctgatctt tcactgaaaa tttacgacaa gtattgaaaa 2700 
tagcagaaag aaaacgggaa attgagaagt tttctataaa aaacaatcgg aacaatgact 2760 
ggaatgacaa ggatgaaaat aatgataact tacattaatt aaggccccaa taatctctct 2820 
attttcaaac ttttttttca aatgttctct ctaactcact tgcatctatg tggaaattca 2880 
catactatac taaattacca caagtatcaa ggtttcacaa cctctcatgc cttcatggca 2940 
gaccatgctg ggtatttgtc taacaatgcc tcataaatac ataaaactaa ctaacaaaat 3000 
aggtcagtct gtaacaaatt attaatgcac cattattgca ttttctaaaa caaagcatac 3060 
actggatatt ggcagacaaa atgttgttat tggatacctt tccattctat ctagacactt 3120 
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gctttccaca agtcatcata aataaatccc ccctatccca aatgtcaatg gaatgcccca 3180 
acccttcccc cataatttta aaacctagaa taaattaaaa catctatagt tcgtcatgat 3240 
catctttctt atcatcctct tcttcttcct cctcctcctt cttcttcttc ctcctcctca 3300 

3331 

ggttcttggc tgcctgctcc ttccttgcca a 

<210> 32 
<211> 5224 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 

<400> 32 _ en 

ggatcccctg ctcgacgccg gcggcccggt acacctccat cgtgcggacg ttgttcccgc 60 

gcccgcgagg gtggacggag gtgccggcgt gacgctccac cagcatgtgc cgcaccccga 120 

gccggcccag gaacacggac gtcgacaggc ccacgagcga tccgccgacg acgaggaccg 180 

gaaccctgtg gaccgtgtcc ccggcccgat cggctcttgc gttcatcttt ctcctccagc 240 

gcgtgatgtc cgcccactcg gccggtttcg gccggggtca tgcatgcccc gcgaggctgg 300 

agcgcggtgc gccggggacc acacttcacc cgcttaaccc gctgcgttcg cgcaggggca 360 

cggcacgccc gacgatcgtg ctcacgggcc gacgcaccgt catgtgacgc gtcggccgcc 420 

ttaccgttcc tccaggaaga ggtgcgcctc aatgacggtc tctgccgctg tgtccacggt 480 

cccggaccgt gtccccctca ccgtgttcga cggttcccgg gtgcgggtcg tgctgatgct 540 

ggacatccgc gacgggacgc aagcggaggt cctggacgcc tacgagcgga tgtccgaccg 600 

ggtcgccgcc gtgccggggc acatcagcga ccagctgtgc cagtcgctgg agaaccccac 660 

ccagtggctc atcaccagcg agtgggagag cgcaccggag ttcctcgcct gggccaacag 720 

cgaggaacac ctggagatgg tccgtcccct ggagccctac gtccgcggca cccactcgat 780 

gcgctactcg gtgctgcgcg agacggccga ggagcgggcc ggggcgggtg cggcggcccg 840 

gggcgcgctg cagccccggc cgcgcatcgg cgacaacgtg gtccggcacg ccgtcaccta 900 

caccgtcaag cccgacagcg tcaccgaggt cgtgaagatc ctctccgcct acacctcgcc 960 

cgaggtgcgc gtggacgaca ccacgcggct cgtgcgcacc tccctcttcc tgtacggcaa 1020 
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ccgggtcgtc cgggcgatcg aggtgcgggg cgacctgcag gccgccctgc gccacgtggc 1080 
ccggcagccg gaggtgcgcg ccgtcgagga agccctcacc ccgcacatcg aacaggaccg 1140 
ggacctcacc gacccgcggt ccgcccggct gttcttcacc cgggccgcgc tgccggccgt 1200 
ccaccacgtg gtgtccgggc gcgggacggg cggcgacacg cagcggtgcg cgctgtacta 1260 
cccggcccac cccggcgccg gaccggcgct cgcccggctg ctggcgcggc agggcgaggc 1320 
caccgtgggc gacccgggca gtccggtcgt cgcctgcacc gtcttccacc gcgacgacct 1380 
cgtcgtacgg ctcgtcgaca cggcgggcgc accggagcgc gcgcccgggg ccgtcctggc 1440 
cctgcacgag ccggacgccc tcgccgaggc cgggcggctg ctggacgccg ccgcgctcgg 1500 
cgccgacggc cccccggacg accgggcgct gccgacgttc ctcgcgcacg cccggatgcg 1560 
gcctctgaca gaccgtcagt cgccggcctc ctgacccccc gctcgcccga cctcagggag 1620 
tgaccgacat gacagaacag caggcacgca tcgtcgcctt cgacgacgtc ccgcccaacc 1680 
ggcggcgcgg cggcgacgtc cgggccctgc tcacgcccac gaccgcgggg gcgaccagcg 1740 
gcttcatggg cgtggccgtc gtacggcccg gagaacgcat ctccgagcac taccacccgt 1800 
actccgagga gttcgtgtac gtcaccgccg gcgccttcga ggtggacctg gacgacgtgc 1860 
cgcatcccct gcgcaccggg cagggcctgc tcatccccaa ggacgtgcgc caccgcttcc 1920 
gcaacaccgg cgacgtcgag gcgcgcctcg tcttccacct gggtccgctg gccccccggc 1980 
cggacctcgg gcacgtcgac accgaggaga ccgacgagac cgcgccggcc ggggtggtgt 2040 
catgagccgc cgggtcgtcg tcaccggcat aggcgtcgtc gccccgggcg gcatcggcgc 2100 
ggcccggttc tgggacctgc tggccggcgg gcgtacggcg acgcgccgga tctccctgtt 2160 
cgacccggcg cgcctgcgct cgcagatcgc cgccgagtgc gacttcgacc cgtccgcgca 2220 
cggcctggac gacgagacgg tccggcggtg cgaccggtac gtgcagttcg cgctggtcgc 2280 
caccgccgag gcggtccgcg acgcgggcct ggacaccacg cgcgaggacc cctggcgcat 2340 
gggggccgtc ctcggcacgg cggtcggcgg caccacccgc ctggagcacg actacgtcct 2400 
ggtcagcgag ggcggctcgc gctgggacgt ggaccaccgg cgggccgagc cgcacctgca 2460 
ccgcgccttc gcccccagca cgctcgcctc caccgtcgcc gagaccttcg gcgcgcaggg 2520 
cccggtgcag accgtctcca ccggctgcac gtccgggctg gacgcggtgg ggtacgccta 2580 
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ccacgccatc gccgagggcc gtgccgacgt gtgcctggcg ggcgcctcgg actcgccgat 2640 
atcgccgatc accatggcgt gcttcgacgc catcaaggcg acctcgccca gcaacgacga 2700 
cccggagcac gcctcccgcc ccttcgacgc ccgccgcaac gggttcgtga tgggcgaggg 2760 
cggcgcggtg ctcgtgctgg aggagctgga gcacgcccgg gcccgcggcg cggacgtcta 2820 
ctgcgagctc gccggctacg ccaccttcgg caacgcccac cacatgaccg ggctcacccg 2880 
ggagggcctg gagatggcgc gggccatcga caccgcgctg gacatggccc gcctggacgg 294 0 
cacggacatc gactacgtca acgcgcacgg ctccggcacc cagcagaacg accggcacga 3000 
gaccgcggcg gtcaagcggt cgctgggcga gcacgcgtac cggaccccga tgagctcgat 3060 
caagtcgatg gtgggccact cgctcggcgc gatcggctcg atcgaggtcg tcgcctgcgt 3120 
cctcgccctg gcgcaccagg tggtgccgcc cacggccaac tacgagacac cggaccccga 3180 
gtgcgacctg gactacgtgc cgcgcgaggc acgcgagcgg gagctgcgca gcgtgctgtc 3240 
ggtgggcagc ggcttcggcg gcttccagtc cgcggtcgtg ctgaccggac cggagaggag 3300 
gctgagatga gcgcaccccg gcgagccgtc gtcaccggac tcggagtggt ggcaccccac 3360 
ggcatcggtg ccgagacgtt ctggaagacg gccgtggacg gcaccagcag cctggcccgg 3420 
atcgaccggg agggctgcgg ccacctgccc ctgaagatcg ccggccaggt ccccgacttc 3480 
gacccggccg ccctgatcga ggacacctac ctcgtccaga ccgaccgctt cacccacttc 3540 
gcgatggcgg ccacccagct cgccctcgac gacgcccggc tctcccgcgc cgacatcgac 3600 
tcgccgtact cggtgggcgt ggtgacggcc gcgggctccg gcggcggcga gttcggccag 3660 
cgcgagctgc agaaactgtg gggccagggc tcgaagtacg tcggccccta ccagtcgatc 3720 
gcctggttct acgcggcgag caccggccag atctccatcc gcggcggctt caagggcccc 3780 
tgcggcgtgg tggccgccga cgaggccggc ggcctggacg ccctcgcgca cgccgcgctg 3840 
gcggtacggc gcggcaccgc caccgtcgtc gccggcgcga ccgaggcccc gctggccccg 3900 
tactcgatgg tctgccagct gggttacccg gagctcagcc gcagcgccga cccgggccgg 3960 
gcctaccgtc ccttcacctc cgccgcctgc gggttcgtgc ccgccgaggg cggggcgatg 4020 
ttcgtcctgg aggaggaggg cgcggcacgc gagcgcggcg ccgacgcgcg ggcgacggtg 4080 
gccggccacg cggccacgtt caccggcgcc tcccgctggg aggagtccag ggccggcctg 4140 
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gcgcacgcga tcggcacggc gctggcgcgg gccggctgcc gtccgcagga cgtggacgtc 4200 
gtgttcgccg acgccctcgg cgtgccggag gccgaccggg ccgaggccct ggccctggcc 4260 
gacgcgctcg gcccgcacgc gcggcgggtc cccgtcaccg ccccgaaggc gggcatcggc 4320 
cgggcgttct gcgcggccgc ggtgctcgac gtggcgaccg cgctgctcgc catggagcac 4380 
gagctgatcc cgcccacccc ccatgtgctc gacgtctgcc acgacctgga cctggtggtc 4440 
ggccgggcgc gtcccgcccg gccgcgcacc gcgctggtgc tcagccgcgg actcatgggc 4500 
aacaactcgg cgctcgtcct gcgcaggggc gccgcgccgt tccccgagta agtaccccga 4560 
acaggtgtct cacgtcccct tcgggcgcgg gcacccgagt caaggagctc aaccacatga 4620 
ccgacatgac cgaacgcgtg ggcacccagg tgaccttcga ggaactgtcc gccctgatga 4680 
agcgcaccgc gggcgtgcac gtggaaccgc ctgacctgcg ggcgcgggcc gaggagggct 4740 
tcgacggctt cggcctggac tccctgggcc tgctgggcat cgtggccgag ctggagaaga 4800 
agcacggcgt gggactgccg gagcaggtgg agcgctgcaa gacgcccgcg gagttcctcg 4860 
cgcaggtgaa cgccaccctc aggacggcgg tgtgacatgg ccgggcacac cgagaacgag 4920 
atcgtcatcg ccgcgccgct ggacctggtc tgggacatga ccaacgacgt cgagaactgg 4980 
ccgcggctgt tcagcgagta cgcctccgcc gagatcctgg agcgcgaggg cgaccgcgtc 5040 
cgcttccggc tcaccatgca cccggacgac gagggccggg tgtggagctg ggtctccgaa 5100 
cgcgtcgccg accgcgcctc cctgacggtc cgcgcccacc gcgtggagac cggccccttc 5160 
cagttcatgg acatccagtg ggtgtacgag cagacgcccg agggcgtgct gatgcgctgg 5220 

5224 

atcc 



<210> 33 
<211> 30601 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 

<400> 33 cr , 

gatcttagac cttattcact tgatacgtgt aatagttatt acgatagtat gtttttggcc 60 

gattcctccg cgtcttcttt cgacgacgtg gaggtggagg caaaagcgaa gtagttgtgg 120 
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aagaataaga attatgatta tcatgattat tattcaaatt aactctattg ttacgtaccg 180 
cgctccatgc agacgtttgc caggagacga cgggtggaag gataggaagc gaagaagcgg 240 
aagcggaaga cgtcgtattt gaattcgaag atgataatga tgtcattgat gctgatgatg 300 
ttttgttgtg ataatgagat cggcatggag gcatttcaca atctctttgt tcgcgaggct 360 
taatctctga gcactcgata tcgtcttgtt tacgaccgga aagcacgtct tcacaccaga 420 
ttttgcggcg ttgaactccc ccgccacacg atacagaaca ctggaattat tattagaagc 480 
ttcaatgatg ttctaagaac ttacgtgagt ccatggagat atctgccaag aattatttgt 540 
gagtggtgga catagttctt cattgcattt atcaaacatc tttggttttc tggtttcatc 600 
gcaagaagac gaagtgcaag atacactgcg acgacgccat cccccaccac aagaagcaga 660 
gcactgaaca atcgtacatt agtaaattct aaatctgaaa attatatatc ccacttttga 720 
ccaatctcca ataatccagg atccaatatg ttcttctccc tttggacagg gttcaagtcg 780 
gcaatttctt gcacttgttg gtctcttttg cacatcacaa tcaacatctt tcaaaatcgt 840 
ccgaccaccg tcttccgcac tgacgcatgt aacatttcta ctttgttgaa catgagttcc 900 
acaagtagct ggacactctt cccattccgc cattttccag tatgaacaat cacgaaggcg 960 
acaatttctt gttgatactt ccttatccaa atgattgcaa tactcatcag gaagatcacg 1020 
aacatgatct cggcacttga gaagacgacg ttgagtacca ttaccacaag ttgctgaaca 1080 
ggctgtccat ggtccggttg cccatcggat tggtggtacg tcggcttgaa gtttttgaag 1140 
tactctgggc ccatcacaag tatctttttc acaagtcttt tttaggcgtg gacgagtatt 1200 
ctgaaatgat attttcgttc aagattaaaa gcaaactgaa acgtactcga tcacaaaaat 1260 
attcatcaac aatagttcct tcagatccac gagtacacga aacacttcta gtctgaattc 1320 
ctgatccaca agtgactgag caaggggacc aatgacttgg tttccaagat gtacatggca 1380 
gaagatggca agtttgacta gtttctggca ttttggtatc tccacaaaaa gaagcatcaa 1440 
cagattgttc gcggtatatg cattcggttg tacgttcccg atgaccgatt ccacaagata 1500 
cagaacactg aaacgtattt atggtattga caacagcaat tctggagtat ttgaataaac 1560 
ttacagcact ccaatcagta tttctccaaa atgggcaagt gcctaaatta catgtctgat 1620 
gagaagcagg ccgatcagat gcagtaccac aaagtgacat atcgacttca gttccatttc 1680 
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cagaaacaca tgaaactctt cttgacgacc atccatcctc acaagaaaca ctacactgag 1740 
accattctcc aagtttatat tttggacatg attctctatg acatggtttt gtaattatct 1800 
tttccatttt aaggcaacga tgttccggta gtactgatct atgacgatcg gtacaattag 1860 
cgtctcgata ctgtacacca tctccacact tagctgagca gtcagaccag actccgaact 1920 
gccaccaagt acaagcatgt tcattacaat gttcttgtgt ctgtgctgga ccacatctgg 1980 
atgtatgtgt ttcccgatcg gctgcatcca aacattgagc atgacgcatt ttgactccac 2040 
catcacaact tcgagagcac tctgaccaat gcccataaac ccatcttgga catggaattc 2100 
tgttacattc ccgttctgtc gcctctttct gttctctgcc gcacaatgac tcatcaactc 2160 
gacgattcga atcatcaacg caatatgact tccgatgcat ttttccattc gatccgcaag 2220 
tttcagaaca tgaagtccat tctccatagt tccattttct tccagagcag tcaatgtaac 2280 
aactggcaat atcggatggt tttgaattac gatcacatag atgttcggat gctggagttt 2340 
gacgatcacc ctccattttt acgcaagaaa ctcgttgacg tttctgtcca gatccacatt 2400 
tggcactaca actagacaca tcttcagtga tccatctgta aatataaaaa tttattatag 2460 
aaatctaatg aaaatatgta gtttaccttg tagaacaatc tatattgcac attcgtgttg 2520 
cttgttttgg tttgagaaca ttttgacaat ttctatcatg actttgacga tgagtcgaca 2580 
tgtccagaca cattaatttt tgcgattgct gtccacgaca ggctctatca cattctgtcc 2640 
aagtatccgt aactctccac aaatacaatg cactggatat tggccgaatt acagcatttg 2700 
gaacagccgc agtcatgtac tcatatgaga tgtcgggtgg atgactacca acagaaagaa 2760 
catgaacata aatgtcactt ctaatcggac cagttccatt tatccgttca ataattgcat 2820 
cagaaccaga atattcgaga acagtgtctt ggaatgcaat ttgttggcga gccagtgata 2880 
cttggaaatg accgttaagt aggaattcac cattggcggc acggagagct gaaggttaaa 2940 
ataaagattt tcatgggtat tgataacaca agggtgagtg atgaaaaaag taaatgttcc 3000 
aaaaacactt tgtatagaaa ctcacaaaga taattgtcat cttctttcat attattatat 3060 
cctttctgcc ggatatcaat atttgcagaa ccagctggaa tcttcattac ttcgttataa 3120 
ccaaaggttc cttgctcatt aaatgttcct ttgacaacct tacaggaaga atcatcccca 3180 
ccgcaaacac cacatttgtc tcttcggaga gttgaatgaa gttgatgatc acagcctgaa 3240 
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aatccactta ttttcaattt tcttttgaaa tcagatcaat gttacctgct ggcatacaag 3300 
ctccagctac acaaatatcg tctccatttc tatcacatgg tgttccatca acaactttat 3360 
ctcgaagcag atagaacgct gcagatccac tgagccgaca atacagcttg caacgttcat 3420 
ttggtgcaac attcgcatat tttggaaccc agtgagtatt cgttgaagcg acaccttgga 3480 
ttccaatatc tttattgttg aattcagaac attgaacttc acggtatggt tgagtatccc 3540 
atgggcattc ttgtgtatta catgaccgat aacgttctcg ttgaccaaca cagtactttc 3600 
caccatttcg aggtctaaag taatatggga aaatgtcatt ttaatattga taggaaagct 3660 
tagccagtgt ggcctaaaag ctggaagttt ttttaaagat gcgttttcta tcaatttaag 3720 
ataaccggct acttcaggtg attctataaa ttttataaag cttggaagct aggtaaatct 3780 
gaaaagcctt aaactatctc gaagcggccc gaaagcccag aaaagcagag acggacaaac 3840 
atttaagagt gatcagaagc actccatacc ttgatgttac atttgatttt agtgtttcca 3900 
cctcgttttc acttctgaac tcgccgattg aaaatatttt gattgaatat attatttgct 3960 
ttcagactat ttgatatcat ttcgtttggc agtttaactc actttgggct gtcacaatct 4020 
cttaatcctt tttgaacacc accaccacaa gtacgactgc attctcccca tgatcgccag 4080 
tcaccccatt gtccgtcaat tttggtaagg gattcggggg ctagacgaac acaggctcca 4140 
tgatgacaga actaaatatc caagttttta tgagtttctt ttgtgattaa tttctgagat 4200 
actcaccatg cttcttgatt cgtcacaagg agttccgtcg gcccatggca tatgctgagt 4260 
tcgacagccc atctggcttc cgtagaatgt tgcacaccaa agacggcggc atgtcggctg 4320 
taaaatatca atgtttcatc ttaaagaata tatttaggca aactaaccat ataagggcac 4380 
aactcagaag ctggtccaaa tacaaacttg cactgttgat gagcatcgta tttctttcct 4440 
ggttcatcac gtacaaagac atcctcgtag taacgacgtt cgaccggctg atcgaataga 4 500 
cattgagttt gacctcgatt atttctgaca aattgacaat taaatagaat caaaatttta 4 560 
atagctatct tactcgagga atcgttcgag cattccagct gaacatggcg accaactcca 4620 
tggatgagtg ttatattcca acgttggtgc cattatgtgg aagttgttct gaaactgcgt 4680 
tttatcaaat ttagtgcttt ggaacttgca aaccttatta accggcatgt aggtagagca 4740 
ttttcgttcg tcatcatgag gaatcgaaaa cacatgaccc aattcatgag caattgtgaa 4800 
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tgcagcactc aatccattgt cttctatgat tgcacaactt ttttgcatat cacacattgt 4860 
tccaagttca gcaagtccaa gtgtatcgca ttttccttgt gatcgacaaa tatctttacg 4 920 
cgtcaaaagg attgcaacgt catgatgttg gacactcgaa tcatctggat cattgtaata 4 980 
ctgctgccat ctacagaaat cttgaagtgt ttgttgagcg ttctgagtga ttcgtggtcc 5040 
agcgttttcc gttttcaaaa cgatcaactt gacaacaacg acattgatag atgcacgaag 5100 
ggattggtga cgatagatgg aggcaactgt ggagaagaga gtgagaacgt agtcttcaag 5160 
agatcttccg tgatattcgt acatttttgt atccgccacc acaaggactt caacatagtg 5220 
atcccaagag ttggcagctc ttcgggatct tgctttgcgt tctataatta aatccttttg 5280 
tttcataaaa ttatttaaac atttttttac tgtatccttc ctattaatct tgcaccccag 5340 
agctccactt tgacctatct ttgttgtcat tgactctatc aaaaactgtt caactatgaa 5400 
aatggggatg caagactaat aaaaggtatt ggtaactggt tccagtagag ctttttttac 54 60 
tatctgtttc attgattcaa ttttcagatg tttatataac catcttaacc gttcaaatct 5520 
cataacatag aacagcctgg cagcccgtga aaaggtgctg aaatcccagt aatttcaatg 5580 
gcattcgacc acacacaagt gatccattat cctttgctct tttacttcgt taactaccat 5640 
tagctatagg ggacccacga gcaaaattct atagtttctg tgtgtgttag ggtgttttaa 5700 
tgggctatta cacaacaccc gatgggatca gcagaatctg agatcttttg ggaaccggaa 5760 
aaaaatattg tgataacttc tcttttttct acatttttta cagaactagc aggtaaactt 5820 
tcagattgaa atctcgaaaa atgdatccgc ctactcaaaa agtcgttttt aaaatgattg 5880 
tttctttgtg tttgtcctct ttttcccgga cgtacgcaac acaaaaccgc ttgcgcgagg 5940 
atgtacacaa aacgtacgtt ctgcgcaatc ttttccctgc agctctctct ctctcacttt 6000 
ttctactcca taaatcagtt ctctgtctgt ctcccaccac ctaaatcatc atcagcatca 6060 
tcacagtccc cccaccaagt tcttgtgtct tctctgacct ttacacgtcg actagggaaa 6120 
agctctcaag cagacactcg agcgccagtt gaaaaaaata gtgtgtccaa atgagcagtt 6180 
tcgaatttga accgtttgtt cttgttctga cataaaccca aaaaaacgaa ctaggcggca 6240 
aaagagatct ggataatcta aagaatctag acaaatttca gaagttctta ccaataacat 6300 
cttcccactg atcttgccac gtggcaaccg tcgtctccgt ctcgttgaca ctggtcgagt 6360 
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taagatggtc aaacgatttg aagtgcattg gatcgaactt tcggacgaga tgttgcctat 6420 
ggcgacttgc tccgtcgtgc tctagatgtt taaagtgtca gagaaagtga ttacaaagtt 6480 
tctacctgtt ccgtttccac taataattgg ctcaaccgta tggattccgc tgggtagtgc 6540 
aagcattccg tactgaaaaa ggcttttatt caccaaaatt cgaacttata caaaccaatc 6600 
cgtcttccga gtcgcataaa ttgacgatgc tgtgctgatg tacaccttta acgtgtgcac 6660 
ggtagataca atcgggatct gttcgagaca ttccacctct aacctcctcc tccgagtcca 6720 
aatataagac catcggcgcg aaatttgaat tggaaaagtg gggaacactt ctggaattga 6780 
aaattaatac gactgttata ataaaattga aatctcatac ttgttatgtg agtccggtat 684 0 
ttgattccat ctgtgcaaat gaacgatgta gacggcatca tctgatcgta atcgtaagtg 6900 
acaagcatgt ccacagtctc tggcaactcc ttggagtcga cgtcgccgat ctgttgacgt 6960 
gacatcacgt tttccacgac gtccataaga atctcttcgg acgatgtgat ggctgtcgat 7020 
gacgtgtata ccggcgtctt gacgccatcg actgtgatgc actggcacac ctgaaactta 7080 
gaacattatt tcacttcaaa actttttgga ttgttacctg agtacttggc cctggagaac 7140 
agcacatctg atgagaattc tgagatcgtg ccactccctg ctgaaaggaa catttggtta 7200 
aaaacaaaag ctgataaatt aaaataatta gataaaaacg aacattgcaa cgcataaacg 7260 
aggcagacga cgaggagtat gagagcggcg acgacgggct gcagcagatg gaatgagccg 7320 
ccgatggagc gcataccaac agctccgtga tgatgattat gattgtgtgg agagcagcaa 7380 
agaaaaaaga gatggaaaga agcagaagct ccgataaagt tcgtccgtct cttctgaaac 7440 
cttccaaaaa ctacctgctc gaggtgaagg gaagtcgtct gattgaactg ctactgcttc 7500 
tgatcttttg ataatctccc gagtttgtgt tttcgtttag tcgaattaaa attgtagatt 7560 
gtggaatgag cacttgcaat agggaacaga gcatcacaga ctgaaaaatt aaaaattatc 7620 
tagaatgcaa gcaattttta aatttgtttt aaaatcactt attctgacgc catcttcttt 7680 
tccgatttgc gcagaataaa taaaaacttg actgtaatat tgggaaaatt tcgaaaaaaa 7740 
acaccgttaa gtctgagccc acctttcgcc tttttttgtt gacgaaaaaa accaaacaag 7800 
ctttaaattc ataaaattcc caattttaaa aacatctaaa gtcaattcct cccaataatg 7860 
catttgtata tgaacaaaag tctgttgacc ataagtcgtt atattactac aagcaattgg 7920 
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tcatcaacaa acctcataaa aatcagtttt gaacgggagc aatttatata aactctgtgt 7980 
gctcttttgc tctttttctt atttcttagt tgtcttctag ttccgccacc actttcgctg 8040 
ctcttgacga aatctgtaaa ttgttcgtca tttttgattt ataagatttg tttggctctc 8100 
ggtaggagct ctcaagctgc taatagtcct atagtaaagt actaaaaaca caaagaagca 8160 
gatgaaggtg tcataaaaca ctgataagaa tcatcatgat taggttggtg cagagaaaag 8220 
aagaagaaga aaaaggagat ttagagaaga gaaacaagaa taaaaatgca aaaataaaaa 8280 
aaatagtaat aacaatgaac gcagagtctt ccatgttgga gaaggaacag gacccatgtt 8340 
gatgtgtatc tgaggggatc caatgtgtag tgatggtagt aaacacttga gagggaactt 8400 
ccacccccga ctagatgatt ggaagcaatt gatgatagat gtagagccaa agaattggga 8460 
cctactaatg atctagtcaa gattcttctg ataagagaaa aagacaagga agaacatgaa 8520 
aatgactggt gattgaaaaa taaaacggtt tatgaagtcg gggtgtacta aagatgcaag 8580 
gtctcttgtg acgtattttt tcttccaggc acgttcgcgt tattcacgat tttatgcaaa 8640 
caaggtaagg agtgttttga attttgaata taaaaattta aaagaaatta aagttagaca 8700 
tttgaaaaat tagacaccct catgggaaaa attatagggc gaggagaggc ggtgagaggc 8760 
gccctaattt ctgctcggtc gggtagaatg tctaatctaa atcctacctc atgtttggct 8820 
ccttcttaaa tcaaaagctt aaggtcatct ctgaaacgtg cagttgacaa gttcaatggt 8880 
aagaacaggg agcaagcatt tacaacaaaa aagtaaacaa aaattgcatt tgtcgcagtt 8940 
caaaatggaa caactcactc ccactcgaga acgttttgaa ggggagagga agaagaggaa 9000 
aatcatcaca caggcacatg gaacttctgg gacacaaaac aatacaaact gggtgccgtg 9060 
aatctcagta cacacacaca aaaatcaaaa aagacggaaa ttaggagcag atgtggtaaa 9120 
gggtggttca atgctgatgg gagagagagg gagaaacttc aaaaaaagaa gtttagattt 9180 
atgttggcta tttcaatcct aaatttatct aaacaattct aaaaatgctg gttttggaag 9240 
gttatctggt aatggtgaag ttttataaac aaaacaagac aaacaattct tgagatctta 9300 
aaaatcttag cgactacaac aatatttagg tattttttaa tggaaaaaag tattgattgt 9360 
tgacttggga aattgaacag caattttttg tacttttaaa tcagttatat tttaactttt 9420 
tagagcacat ttcgtagaca aaagggaaaa cgattggtcc aacatgtgaa gatgatgatg 9480 
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tcaacaagtt ttggatcgga gccaaaaaag aaacaaaaca ttcataccat gatgggaaac 9540 
aagaggtgca gcaacaactt ttatcaatat tttgtttatg ttttgattat ttttctggca 9600 
cccagccagt aattcttttc cgtagagttg acctagaaaa tgttggaggc ggagtcttag 9660 
gatcaagaga cgcagactat caaagtaaaa tgagtaaaag gaagtgatat aaacttagga 9720 
aacggaggaa aaaaggacga tgataagaga ttgaagactt ggaagagtgt gctctttgcg 9780 
ggagagcata ttcttttgag aaaaatggga cctaggggca actgacgcaa ttgaaacatg 9840 
gtcgagcggt cggcgggaag acaaaaagtg aagaaggatg ggcaagaaga agcaagagaa 9900 
atggcaccca ccgtggaaca tgatcatgat gattgagagt gaaaattgga aatctcgaac 9960 
ttttttgcaa cggcgcgttt tggaaaacta acaaagttga ccaaaaaatt attttacatg 10020 
tataccggga tgtctaagaa ttgtaaaatt gagtgatcct ttctgtgaca taatttaaag 10080 
caatttattt tggttatttc taagcgcctt tttatactag catgttatat tgttaatttt 10140 
attatctaaa ctgccgttct tcctatattt attattgcac cccctttgtt cattctgaca 10200 
gactatacct cgattaatca taaaaatgtc acaaaagaat aaaaacaact aaaattaaga 10260 
aaatacaaga aatttatcaa ttgccaaaaa ttcggccaat cggaaaaatg cttggttgcc 10320 
aatttgtcaa aaatttagtc aattggaatt tgtcgatttt ccgaaatgat atgaaagttt 10380 
gaatgatgca gctaattttg cagtttaagt ttacattttc aagtttactg taatttttcc 1044 0 
aaaatatgaa gaagagtttt acgaaattaa aagataataa aaaagcaatg caaacatagc 10500 
tatgaaatct gatcccgact aagtttgatg gacataggat taataatatt agtctaactt 10560 
tctatagaac actaaataaa tacattcact ctcgaaactc tcccttttct gccatcaact 10620 
accgtactca cttttgactc aatgacccgc aactgtcaag atgagttagt ttcaagattc 10680 
tctgaaacag caataatcta acaagagaaa ctgaaaaaat agagtaaaac taataataat 10740 
accacataaa ttgacatgca tgatagatga ttttccggtt ttcaacaaga aaaacaacaa 10800 
tttccgagaa atcctcatag tttttggtaa gaaaaaataa attgatagtg atacggtatg 108 60 
actattactt ctaaagactt acctgattag aaacgtgtag taattgaaga agaaaagttg 10920 
aatttgagaa gttgaatcga gtttacgatg tctgaaaaaa acatagatat tatggtaaga 10980 
tcaagcatag aaaaaatgga aaaatacaag aaaatagaga ctagagattg cataggtttt 11040 
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gcggtggcga aaccgcacac atttttgtct 
tctctattta ctgtccagaa gaatgaagaa 
agaccgagtg atgagagccg cagttttgca 
gtactacgag aagcacacac acatacgaaa 
cgacatattt aattttcgaa ctaaagttta 
caaacaactt tcagttttgc ggaagaaaat 
caacactatt gcgtatcaaa aatttttcga 
ttttggtttg caaacattta aaagaactca 
aattgtagga tacaaatagc tacagaacaa 
tttaactaat acaaaaacac agaaactttg 
aagcttgtaa aatttaaata tggatacagt 
catttattta tatatcttgg caaatcaaat 
tctacaatta aatcagttgt tttttcatcc 
acaagggcaa agagtacatt aggaagagtt 
gacctctttg gattgaaaaa gcgaacgaat 
ttcgtggcct cgagaatagc aaattggcgg 
aacaaccggg acgaatggat tactgaatca 
tgtgaatcgg atgaaatgtg atgattttag 
gacgcggaag aacaggaatg atcaaggggt 
tgctccaaat gtgagagata atcaggtagg 
aacgatagtg cgcaggtgct tgatttctgt 
gaacttttca aggggtttcg gaaagggttc 
ttggaagaac tttcaggata acatcgctca 
tatataccga ttattgaaac attttaatcg 
tttgaaaaaa aaatcacaat tttttcaatt 
ataacttgca acccgggtat ttcattctaa 
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gtgttatctc taattttacg ctctcggtgt 11100 
tatgggggaa aagtgcgcgg gaaaattgag 11160 
aaactttttc gggcaataat ccgccggcga 11220 
actgttgagt taaaacctaa aaaattgttt 11280 
gagggtctgt gcgtgcattt ttgaattttc 11340 
tacagcgatt ttttcgaata tttctgaaaa 11400 
tttgccaaaa ttcagactaa gttttggtgg 114 60 
aaaaacattt ttagatgttc gaaaccgtac 11520 
ttagaatata aaatagagtt gtcaaacatg 11580 
aaactcgaaa tttttatatc aaaattgaaa 11640 
acaaacaata taatcataga tcaaatagtt 11700 
cgtatccctt acccactcat attcgatgag 11760 
tcccggacta ttagtttaac ttccacttga 11820 
tatgatgaca ggaaaaaagc tatgtaaaat 11880 
tgaggtttag gacccccgga aaatgaagaa 11940 
aattaattat ccgtaagagt gtgaattgga 12000 
aaaatgaaag aaagaagaga tgaaaatacg 12060 
aataacctaa atgcaacaaa acgacgtaaa 12120 
acatcttata ggggaaaaat gcactttttg 12180 
aagagacgta gaataggaac aggaaacggt 12240 
gcttttgcat gtgttccgat ggaatttttg 12300 
gagatttcgc atgtgagctt tggaagaatt 12360 
agcttgtttg ttagatttca gacttcaaag 12420 
tttcttacta ttagtaaagt ttaatcacag 12480 
atttagacca aactaattat ggtacagaaa 1254 0 
tttttttcat ttggaaccac tagtttttga 12600 
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aatagaaact cgttaggatt cttcacatat 
cagatctaag ttcagtctaa ttagaatcgc 
tacttaggca cagggcatcc ttctaacttt 
acgtgaggaa tcaaggagaa aaaggaaaag 
gatttctgtg ctcattcctt gttcggatga 
gcgggaatcg ccattgccga actttttcaa 
acgttccgat ttttaaaatg gaaaaaaatg 
gcgtgacatt tgttaatatg gaaaaagcat 
cagcacacat tttgaagact tttttacaaa 
gatagtatcc ttaatgctta atcatatcca 
ttctacatca aaaaatcata cttagtgatt 
atatttttga tttttggaag gatgataact 
gtttaaaacc tctgaccttc tacagaaaaa 
gttcagaact tgtctcgggt caagccatga 
agagaggttt atgactatta ttgtagttga 
acagatggtc ggatttcatt cataaaatat 
ggtcaactgg ttttaggttt cttggaattg 
caaatatcta ctaaattgaa gtttgttatt 
aatcactcag agcaaaaatg ttgggaaaac 
tcttactatt agtagatcaa aaaaccagac 
tattcaaaaa tgagcaaacc aagaaattgc 
cgaaacattc gctctacctt ctttttttct 
atttcagtca aggttttgga aaattgttcg 
agtcaaaaca cacaatagga atataaaatt 
agcaaaatca aattctggaa ctgcgacttt 
gtctttcaaa atttaaattc cagaagagtt 
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tatcataact atcagtattt tgttgcacat 12660 
aaatttgacc atcacacttt aaaacaaatt 12720 
tttgtccccg acaaaatgat gacaaaaatg 12780 
aacaggaagc gaaaagtagg agaagctctt 12840 
gctcactgtt tgcaacattg gcgttggtgc 12900 
gagacagaga gagagagaaa gagaaggaaa 12960 
aaagaggaag atgatgaaaa aatgaactct 13020 
gattacttca aaattgtaca ctaatcccca 13080 
aacaatggtt taagcaagct ttaaaaaatt 13140 
agtttagttt taagttttga tttcaaaaat 13200 
atatgcaaaa caatttttaa attcaaggac 13260 
tttttgtgat tccgaaaaag attaaagtag 13320 
acattacctc tatgaatttt ttttcatctc 13380 
agacatgaga tagggtgtaa aacgttccga 13440 
agagaaaaat gatatctcaa tggatttcat 13500 
cataagaaaa ggtacgttta tgactgtcta 13560 
tttcaaacat ttttaggaaa tattttcttg 13620 
gtttttgaca tattgtagat tttagagaag 13680 
gtgagaaaaa tccaagagac aaaagaatgg 13740 
caattattca tattcctact attcaatata 13800 
acctaattta tcatcccaca tatattccga 13860 
gtctaggaat tataaagggc cataattata 13920 
actaaccatt atgaaagtta aaaaccaatc 13980 
cgtagaagaa aagctttttt tttggtcgaa 1404 0 
tttagtgcaa ttatccattc aacgcaagtt 14100 
ataacaaaac agacaggtgt acaagtaaaa 14160 
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gaaaaataca agttttatcg taaaaactga tacgaatcta gatacacctg ttaaaaaagg 14220 
ctttctcgaa acccagatgc cgtacgaagt aagcagcagc caactaaaca ttttgagtaa 14280 
acatatggca agtgttttgg cgcaaattgt aaagattttc cgtgtgggta actagaattt 14340 
gaaactgtaa gtatgacgac ttaaccacac aaaatcaaat ttcaaaagat cttaaaatgt 14 400 
tcgaactttc aaaactttta agctctctcg catctaccgt agtcttctaa taacaacagt 144 60 
cgtaagagaa agctcaaaat ttttcaaact ttttctgaat gacagaatca gttgtataca 14520 
aaaaaaaccc ccaaaatgcg agccccatga acctgacaac cagacaagtc gaaattgtaa 14580 
aatcgtatag atcttggttc acgacatgaa gagcaccgcg ggggcacacg agagcaacta 14640 
ctgcaagcgc tcctgaagag aagaaacatc ttttttccag gaccactggc cagtagtgct 14700 
cccccagatc actttctttt ttcttgcttc atctgatttg tgtctgcgtc gtctgatctc 14760 
tttagaacct atccttcttc ttcttctttt tgatacttcg acatcagaac aacatcgaca 14820 
tgtatcatct tttctctttt ttttttgtta tctattcatt cattcacttt tcatttagtt 14880 
tgattaatag gtgacatgaa ctcttgtcac ttttcaattt caacttctta aatcttaaac 14940 
tcacagtgat tccagatatg agcaactcca atgaggtgtt gagtagaaac ctaaatataa 15000 
cattttggat gttttgataa tgttggaaca aataaattga aacaaacaag acttgaaata 15060 
gagacaacgt gcagaataat gtctaccagc tggtttcagt ggcatattgt accacgaacg 15120 
tccgacagaa cgaataacat aaagatcaag aaaaactgtt tgggagcaga caaacaatca 15180 
gaacacagtt ttgttgaggg gaccaaatca taattaatga ctaaatttta acgaagaaag 15240 
tgctcgaaaa gaacagaatt tagaagttga tgaacaatat ttttactttt agattaacaa 15300 
ttatgcttta caaatgacat ccaatctaaa gcatctggta atctgaaatt tgtcaaaaca 15360 
gctttcaaga ctagtttcaa atttgtcgat tcaatggatc aagtgtgtaa ttgatccaat 15420 
aaaaaagagt ataaagtgag aaggaagaaa gtgtgaaaaa agaagaacgt gaaacgtgca 154 80 
gaagatacga aatgagtttg aagactgcac ttttcgagcc tcgatggtca gtcacttggt 15540 
cagttgcgaa aaagctgtga aaatgataca ttgtgtcggc tctcgtagag aagaaagcca 15600 
catggtcagg atgactccaa ctgggatatt cagttgtaaa gaacacaatt gatatttttg 15660 
catctttttt aactagtttt tacaatatga gaaattgttc tgtgcgaaaa atatgacttc 15720 
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ttccttgttg ccgaagtgta tttccctgga aattccagta aatacctaat gtaaaaaatc 15780 
tcagcagaat gtgttcttac attttgttgt aataataatg tattaaaatt gcattaatta 15840 
aaaatttctt caaaatgttc ctacgtcttc tatgcacatt atttaggtca cagtttcatg 15900 
gagcacaaaa cacctgccga cgcctctaaa atagttataa ctgcgcatga aatcaggtag 15960 
aaaaaactac aaaataacca atacaaattg agtagggcga tggagaggtg ggcggttgga 16020 
gaggcgggca acaagcgtcc tcatgacgcc ttgttcattt agaatgtgtt tgctttgaat 16080 
tacatacaag tttctaaaat ttaacttaca aaatttaaaa aaagtcacaa caataataaa 16140 
agttgtggca atgaaatgtt ttaaaaatct aaatattgag ttttaaataa atgatttttg 16200 
aaaattcaca aagaaatgtt acaatctgtg aatgaagacg aacaatgaaa aagtgaggaa 16260 
cggacgcgga tattacacat tcagtcacac aataaacgtt cggacactac cacacatttc 16320 
tctcatcatt tttttccaaa gtttattcta aagttcaata ttttagtttg attattttgg 16380 
acactattct taaaattaat gtataatagt ttagaaaata ttttgaaaca tgaaactttt 16440 
ttgttgataa aatagtgcca aacatcctta tgttacgcag ttatccaacc acatttttct 16500 
catttttcca ccaaaaaaca ctgaaatggt ccataaaacc tattcaaatg gatatgagaa 16560 
tattactttt ttgacatgaa attttcaatg atgtaatgta aaacaaagaa aaatattgcg 16620 
ggaaaaattg aacggcgtat tgcaaaaatc ggtgtgcgga ggaggagaag gaaaaggaag 16680 
agcaggagaa gcggaccgaa gaattcagaa gcttttaaaa taagaacggc gactttcaga 16740 
caaacaatgg actgttgtat aaaaataaag cggaggcggt agagagtcaa agctttcaga 16800 
aatgtattag aataggtttc actacctgtt gttgaactca aaaaggtgtg aaaaagtgaa 16860 
agtttgtctg aagtttatga cgggaagtgt ccatcaaata actttcaaaa tttgacttat 16920 
cagtgagaaa aacacgtcat tttggaacgt taaaatgggt ggcaccgcaa aatgttcaca 16980 
atgtgaagtg aattacgtaa taaaatcagt tttattaagc ttattaaact aacccttccg 17040 
gactatttgt ggaatgaaac aattgggggg gttttttttt ccaattttcg attttttttt 17100 
gaatttataa ttaccggaac aaaaatatct ttaaattatt aagatttgag tgatgtttga 17160 
aattttgaac ctgcaaaaca taagcacaaa ataatggagt ttttgtttta aaatatcaat 17220 
aggtgttttt tcacagaact ttaaacaaca aatactcata atttgaatga aaacagtaga 17280 
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tcccacaata ttttgaaaac ttatctatat 
aaaacaaaaa gaaaaaaaca aataatttgt 
aaattgggca atatggtgaa gaaatacggt 
gtgttcctag tggaatgaaa ctaacagaag 
taaaacgaac agcggaagaa aagatctagt 
aatgggcacc tatcatcatc tcacttgttt 
tggcttctag ttcaagaaac gaaaaaaaga 
gaagagaaat atgagaaaat gatgatgata 
gaggaacgga aacgttatct ctgatcgcca 
aagttatggt atctttgaaa gaaaagaaaa 
ggagatttca ttcgaagaac ctaacccaat 
aaaacagtca atcacaggaa caggtgtctg 
atgacacaca atatcacaga caaaatgatc 
tatagaaaga gagggtgaac aggagataga 
tagaaaataa caatttttga acaacaaaga 
gaacttgcaa atttttagaa ttatcatata 
tataagtctt gcagtatttc ttaaatttaa 
tcaagaagtg ttcaggaaat tttaggacat 
ttatcaaaat tttagaggtc tagtacacat 
aaaaccacct agtgtatggt aaagtacatg 
tctttgtcga tatgcgggtg atggtgtttc 
tgctagggtg tcaaaccaaa tttatttcaa 
aattattcaa attctcgtaa tttaatgaat 
gagacaatga tcgaaagtta acaaaaaaaa 
gctgacattg gaacatatca aaaagtggta 
gaaattgaga gtagaaaaga tcattgaagc 
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atatatatat atatatataa ttacgaaaaa 17340 
cagttgataa tttttagata tgagttgcca 17400 
agttcgtcgc actgtcagac taattttcaa 174 60 
ctatacggta tataatatta ggaacacaat 17520 
ggtcacttcc gatttctcag ctgacttttg 17580 
atttgaacag tctcgacttt ttccaattgt 17640 
gcaataacgg aacagaaaat tcagaaagtg 17700 
ataataataa gttagaagag ggttatcgat 17760 
tctcattatt attatgagac acaaagatgt 17820 
caggaaatta tacagaacac acacaatttc 17880 
ttgaactcac tcccacttcc tcttgtctat 17940 
tcttttcaaa atgtatacgt tttccgaata 18000 
aatgaggttg cagaaaagaa tgcaaaaaaa 18060 
gaatcaaaat ttgcatagat aaatatgcaa 18120 
aataatttag tggcatataa tatagcgatg 18180 
aaaataacaa tgtttctata ttttatgccc 18240 
cagttcattt cttggtaatc tttattttta 18300 
caaattttta tttattttct aaatctactt 18360 
ctacccaaaa agaagacttt ggagctctca 18420 
agaagtgacg tgtctttggg cagctggcca 184 80 
tgtgagcagt aacaggaaat tctggacacc 18540 
cccattcttg cttcaaaaaa cccccaacta 18600 
cactcagtaa ctgtaacgtt ttttttttca 18660 
ctgaggatta aacgttattt ggtatctaca 18720 
agtgaaagtg aaacgaaaag tgcaacattt 18780 
agaaatatgg aagtgaattg aaagccgtgg 18840 
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cgccaaaacg acggtcaggc gccattgaga 
acacaaagac aacgtgaaaa attagtttgg 
tttcttagtt agctgagaat agtttacatt 
aatggaaaca tgtctagact tcagataaat 
actaacaaag gttacaaagc tattaggaaa 
tttctctaaa ataataacac cattaatttg 
ttcgcattac tgagcacaaa acttggaaaa 
gagaaatcat taaaaacagt acaatttcag 
ttttgagtaa gactaagtat gcactcacaa 
aaaatactag taaaaatgag catttcgaaa 
tcaattaaag gaatcaattt tataatagtt 
ctagcacagg gagcattgga agatataaaa 
taaacacatc atatctatca aaccatcact 
agtgtgtgac aaatataccg taatattaag 
ttttgcataa atactaaata attgaaagaa 
tatgtgtatt atcaaccatc aacagttttt 
agcaaattag gagacgcaga ttgagaaaaa 
attgtacaat aagacaggta gcatctctca 
caacaacgtt catttatttt ttagccttca 
ttttttaaat tcagatttct cattcatgta 
caaagtttaa ttatttgagc gcagaaaatt 
ctaacttcaa aaaatcctga ataaatacca 
tttacgtgcc tgcctacggt ctattttcta 
tcttcaaaaa tcttccttaa aaaatctcca 
tccaataaaa tactcttatg caaaaacccc 
aattaaagat aaaaagttat gatagacagg 



aaattaatga gagttcggaa ggttgaaaca 18900 
agaagataaa aaatgtctgg agatggacga 18960 
gattttcggg aaaacgcaga atgttagaaa 19020 
ttgtagaatt tatatttgta gcaaaagcac 19080 
aatacggaat gtatttttga aaatttttga 19140 
ctatatttgc tatatatgct atatagtatg 19200 
agtttaaaaa aaaaggaaac ttgttttctg 19260 
acagaaataa atctttcagt gaaagctttt 19320 
cttttctgag tgttccaaaa atgtttaaag 19380 
agcaatatat catacaacta cacaaacatt 19440 
ctaggcaatc ccacttttag attcaatttt 19500 
acataaagat aaaggtgata aaagatccat 19560 
tccatcaaat ccacagattt atcacaaatc 19620 
ttcaaatggt ggaaaagacg cagacaaagc 19680 
acgcagagaa tgtaagagaa aaatatacaa 1974 0 
gattaaaacc atggagaagc gatatacagg 19800 
atgagaaaat aatgaaagta cggaagggtt 19860 
aagaacctat tgtcaagcag tttaaacatt 19920 
ttatgatatc tcattggttc tataattgga 19980 
caagtaaagt tgttaattgg ttattatgcc 20040 
tgaatggaaa tttcagaaaa ctgattcatg 20100 
attcttttcc aagtatgatt ctcgagcctg 20160 
atttttttaa tgataaaatt ttagagtaga 20220 
aaaaaatcaa gttcaggaaa actaaagtac 20280 
ccattcattt tgcagaaaaa gacaaacaag 20340 
aagctgattt attagatcaa tgaatcgact 204 00 
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tttagttttt cttgaactct aatttgaaat 
aaagatcaaa agttataatt gaaaatcaac 
aaaatgtgcg gtacatgaga cagtagtagt 
tcggctactg ataatttcct tttttactga 
tgtttccaaa aattgtgggc gtggtttatg 
taaagtaaag aagaaaatga cagaggagaa 
ataaattaga ttattaaaga taatagttac 
tccaaaagtt caagtcctaa gaatcttgct 
'cctaaaaatg gcggacagct gtcgggacac 
caccgtctct tatcatcatg ccatttttcg 
ttaaaaagag agagtgtgag aaataagagt 
ttccaatttt taaaatgtat ttaaaagagt 
tggaaaaatc gtttttggat tattttttga 
.ctataaaaaa ctttgtgttc acttttttat 
aaatttttat gaatgtttta ttgtaaaatc 
ggattgcatt taaaagtgca ttagcagaaa 
gagatccaga agctcgaaaa acatccaaat 
aaaaaagatt ttagcccaat caactaaaaa 
ttctggtcac tttgaaggaa acactaatcc 
tcccatttac tatttcttgt tgtcttcaaa 
tttcgaaata ttgcggtttt cgttaaaacc 
ttcccaagag aacttgatga ccttcatcat 
gacaaaaatg gaaatgactg aatcggaaag 
aaattctgga aaacttttta aaatttaaga 
attttttagt gatggagaag tacgtgataa 
tgaatacact acgtgttttt atttatggta 



agtattcgaa tgagaaaatt gaaaatatac 204 60 
aaattgatag tgtttgtata ggattaaatt 20520 
agtagccata gtacgtattg gtggctccac 20580 
taatttgatg tcatttcgta attttatttg 20640 
aattggtcaa gacatgaatt aaaggaattg 20700 
attattttcg tttgctttgg aaattgcaaa 20760 
ggtttaaaat aaataggtga taaaaaaata 20820 
attttgcaaa aaaaaagcat gagcttttgg 20880 
tatccaagaa ttcgtgataa acgggtgaag 20940 
aattttaaac tcagactttg ataaagaaaa 21000 
acacatggaa aatgcaagat ttgaatttgt 21060 
taccgttcca tttttgatta gctttataag 21120 
ggaatatttt tgaatgcgct ttcaattttc 21180 
cccgttttta tttttatttt tacaactttc 21240 
ataaaaaggt gcgaaacatc taaattgcct 21300 
tgtattccta tggaatgttt tttgtgcaac 21360 
ttcttccaag aaagttgatg ttccaaaaat 21420 
aaaactctcg tttttttcat atttcacatt 21480 
caaactgaga accgaacatg gattaaacca 21540 
aagtcttaga attgtgcaaa aaatagaatg 21600 
ttttttgagt agattgaggg tccattagaa 21660 
caaaattagt ggtcattgaa tgtttgatca 21720 
agcaagaaaa tcgaaaaaaa aagtatttgg 21780 
agggcaacga taagaaacag gaaattaggg 21840 
ggttaaggtg gaacactagt gcacacgttt 21900 
gaatatagca cttaaagaac gtttttaata 21960 
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caaactgaaa taaaaatacg gaaatgtaat • tttttttttt gaaagaatcc gcctgaaact 22020 
gaattttcac atcaaacggt agtgattctc tttatgcgtt gggtgatatg tatttacgct 22080 
gtcttaaagt tttcgactat aatttaagta atatgtttgt caaaaatcat catggtgctg 22140 
tgtcctatgt agccttttct acacttgaaa aatgataatt tttatttgaa aatggtattt 22200 
aaattcaagt agaaagttat ttagtcttgt gtgccaagca ataaacacat agtctattag 22260 
gcaataaaaa gtcagctact gttt'gattta aaaacttaga ctactggtgt gcctgtgcaa 22320 
gttactcccg tagtacggat acagagtgaa aactagtgat tgtactttag atcggctgat 22380 
agtgaattta cagagaaata attataaaac ttaaaatttt tagcagctca gtcttcaggc 2244 0 
tgcacagcca tattgttaca cttggagtta caaattctgc aaaccatcta ggattgaatg 22500 
caaaaactct gaaagtcaca tcaagaaatt ccaacaaaaa acacattaga tgccaactca 22560 
ttgaattgca ttgattccca agagaaatag tagtaaaagt gacccctatc cattcctccg 22620 
ttacatacaa atatacacac aaaaaagagt gtagacctct tccttctaac ccaaccaaca 22680 
cacaacaata tcgttccctt ttatctctaa ttctctgcgt ctccataagc tttgagagct 22740 
cttcggagca tcttgtgctt gctccttgta cggcggtaca gtttcctccc tctgctccct 22800 
tatgtgtgtt taggtgttgt ttgaacaaat aagtttttgg ccatccacct ccttctcaaa 22860 
acctttttct tatgcttctt cttgttttgt gcacattttg gctcttgctt gtctgctcga 22920 
gccatagaca aggcggcgac atttttgaaa aaattatatt agtactgtta tatagtactt 22980 
aatacaacga tcacaacaac aacacaacga aatgaaaaca tgagatcaaa agacaaattg 23040 
ttaggaggag ttggagtttc tacaatcatg aaatgtttat ctagttatta taaaactgaa 23100 
attgctcata aaattgtgat accatgaaga ccgaaaaact ctatgcaact gcatactgca 23160 
catacttaca acctttattc tgacttgaat ttcagttttt ggtgtttgca gttattctat 23220 
tttgtttaaa agaaaattca attaggaaat aagcaataaa ttttggcatg tatttcgata 23280 
gaaggcacgt gtaaatgcca cccggaaatt agaaaaaata agatttctca aactgaaaat 23340 
gattgtgaat tgaaaattta agagaatcat tgcaaaagta cacaaatgaa tcatttttca 23400 
gattgaacag gaaagtgcag aaatatcaga ttaccgtccc aacagaaacc ggaaataaca 23460 
cttttcaggt aaagaattat acagaaatcg taataaattt aaaacaaaag agagttatga 23520 
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cacattgcag aacggtctct gtggaaaata ggaggaggtg ctgcaaaaac tccttagaca 23580 
tggtcatact tacaaaaaaa acagagttta actaaaaatt aaattaagtg agaaaatgaa 23640 
gaaaatggag gtctttcgcg gattcatttt acttcttctt ttttccactt ttcgttgcaa 23700 
gctttggttt aaaagtttcg caaacaaata aacaatgaac attgtgttga gaagacaagc 23760 
caagtgaaag gaaaccattg agagcaaaaa caacaatcaa ttgaaataaa gagtaaagtt 23820 
tattgaatat actgatatgt gaatactgga aaaataatta gtctctataa ttggtaccgc 23880 
ctggaagatt catttctgat tcccttgtgt ctttgaccaa aactttattt ttttcagttc 23940 
aaaattacaa aaaataaata ctcatcttca tcgattcagt ggtgttttaa actcctacgt 24000 
ttttctttta caataaaggt aatgtaaacg ttccgagcgt gtagttttct ctgaaaattt 24060 
tttaaaaata acaactttat ggtatttttc ttaaagtctt aaactgaaac cgaaacattt 24120 
ttgataggaa aactatttta acattttggg aactcggcaa aagctctgca ggcttgccga 24180 
acaactctca tttgaaagta ataaatatga aaataaatta tcgaagtttt tttttttgat 24240 
attttatgaa tacggctctt ggtagttttt gacgagaaaa ttacatgttg cataaatttc 24300 
aagagttata actcatggag accctaattt ctggtttcac tagaaaatca aaaaatcaag 24360 
cgtttgagca gaagactgta ggaagagcac acgtcataaa aattagggga tcaacgatcc 24420 
gaaacgggga attgaaatac gatatgcgat gagttttggt tcgaaccggc tttgtcccaa 24480 
aaaacaacag aacgatggtc tcaggctcac ttgactcatc tcggtgggaa caatttttat 24540 
ttgtttttat tccgtacgca cagaaacttt ttttgaggta tttttgatcg tgggtgggtg 24600 
gaatggtagc acccaatttc aaatagtgtt tgatttgaag agacaatgaa agaaacaagt 24 660 
gggagataat ggaaatgacg tgatgaaatg gaacggagga aaactggtat aaatatcgtt 24720 
gactatcaaa actacaataa tactaatgga gaaaagttca ggattcttga agattttaca 24780 
ttatgatagt tgggatttac tggtttcaag ttcaaatgtc aaacatctgg aagaaaaacg 24840 
tataagatta catcaaaata aaactaaaat ttgaaggata aagtaaaaca gcataatata 24 900 
gtgttttaca tctcatgtag gaaacgaaca aaatctttga acacctagat aacttcaaac 24 960 
ggaagttggg tgaagaaaag aataggggcc agaatagaag gtcattttga caaagtgaac 25020 
agacaaagac attcctaact cggaggtatt ccaaaaactg ttccaatatt gaagaatgac 25080 
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actatttgat tttatatcat aacattatta 
atcgcaaaat aaaaagtggc cttgatgagt 
ataattaatt ataaaaatgc tgatacttga 
gttttcttta ttccgttctc actttgagtt 
agttcggcgt tttttgtcga taattccatc 
caaatatttt agccagagtg aaatttatta 
gtattattct gttgaaggaa catgttgcat 
tgtttagaac tttataacct cgtttaaaca 
tacatgagtt tgaaacagta tcagatacta 
ctatgagaga ttctgataca ttggtttcca 
ctcccactga attacagatt tctgttctag 
ggcatcttcc tcatttttat cttcagtttc 
tgcatcctta ccatctctcc ctctctcact 
aaatgttttc tgccacgtca tcatcatcat 
aattaccacg gcatcaaaat attcgatatc 
cacttaacta ttttctatgc gttattattt 
acgaacgtta tggtggaaaa cctgaaaatg 
attcaaacga actgtcatta atctttctat 
tccttggatt tggtccaaat gactcaaaaa 
aaccaccacc cggatttgtt gataactcac 
aaaaaacttg atatgaagtg ttgaaaaatt 
ctagttatct tgtatttcaa atattaatat 
gtacttcagt aattaccacc cacattttga 
aacttttatt aggatgtttt ctcattttgg 
ttttatgcat caaattaata tctaattttt 
tttataagat tttctcgcta ttaaattagc 
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atcacatggc ttttttctta ggaaatttat 25140 
cattcattca aaacatgcct aaaaaccttc 25200 
ggacccgttt ttttatattt ataaacagtt 25260 
tttttctgaa aatactaaaa aaattaacaa 25320 
tgattatttt cggttttttt acctaattat 25380 
tcttattaat atgtttttca atttgttttg 25440 
tttaaatctg ttgttaatac agcggccaca 25500 
taaattgtat gccatattta ttgcaagtac 25560 
tattttaaac aaaaatacac attttccccg 25620 
atttttttaa aaacttgaaa ttcctcaagt 25680 
atacctccaa agacacctag attcgacttc 25740 
atcttttgtc taattttccg tacatttctt 25800 
cactcttctt gttcactaaa tctcaattca 25860 
caatgccacc ttctcagagc ccattcgaaa 25920 
acgaaaaatg cttctcaatt ccacttcata 25980 
tttatttctt tgttttcact atattttatc 26040 
ttcaagttac atcagcaatt tatgattcaa 26100 
ttgattcttc aattcgtcga cgggaaatat 26160 
catcaagaaa tgaaactcaa attgagctta 26220 
aaatttcagt aagtttagga ttttttttca 26280 
gataattggg ccgggcttac atcagagtat 26340 
tcaaacattg tagagattcg aaatgcgaca 26400 
ctgtcaaaaa agttcccaaa aattgtcgaa 264 60 
cacgattgga gtgttttttt aacaaatccc 26520 
aaatcaataa tttggattaa ttcaacttgt 26580 
aaaaaaaaac tatcttcaaa caattagcgt 26640 
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gctttaaaac tactaggcct ttgttggcaa 
aactatgctc agaatttggt aatgtttgaa 
attctaaagt aagattagtc atctattttc 
ttccattatt taacaaccaa gactgagcaa 
aaagacataa gcaaaagctg gatcaaagct 
cgtttccatt ccctcgtttg taaagtacaa 
tgggagattc ataatgtgaa aactaaatgt 
accccagtct ttaactgtga atgcagaaca 
aggacatcca tatttagaag taagttcaaa 
gttaccactt caagaaccat actccgattt 
ttatggagat tttgagatgg ttttaacagg 
aacatttaat ttttatagct ttaacatcat 
catatgaaga gcttactgga gagctttaca 
atgtcaggtt aactgcattt ccaatgatac 
aaaattatca tttgaaaatg atcaaactta 
taatgcataa aaaccgggcc aaattcatga 
cgatttcctc gacgctggaa aatcccaatt 
caccacgagt tgaaggatat gatattttag 
gcttaggcga cttttcggat ttctccatca 
aattaaccgg aaattctaaa acattaatca 
tttactgttt ctattttggc atttatttca 
acgcggtaaa atgtatccat ttttcaaaaa 
gaaaacattg gacaaattgg atacaataac 
gacacttatc aaagaatgtt cagaaaattc 
tgtaagacca tcagttagac atcgatctct 
ttctaaattt ttgaatgttg aaagtttttt 



cgtcttttca cattttggca caaaactata 26700 
aatgttttgg gcaagcatat agttattcca 26760 
cattccattt ttccattttt cacctatttt 26820 
acattttcct gttttaattt tcatatatga 26880 
tgggcaaatc ctattcaaag tattttccaa 26940 
ttggtaatct taaggcttaa ttaattattg 27000 
taagatttgg tcatcaattg aaaaggaaaa 27060 
tccaaagtca ttgcttttac gagatcacac 27120 
tcagaaatcc ccaatccatt ttttcttgta 27180 
tcgcgacatt gttagttgtt tcagtccaat 27240 
tttaacaaat taatttggtt tcttttttaa 27300 
ccatatcaat gggatcattt gttagtatac 27360 
agtttctacg tgtatttgaa aaaacgggac 27420 
gtcatcagcc tcgcttcgat tcggaaaatg 27480 
aaacagattt aacgcatttg cattgttggc 27540 
tcttccaaaa ctctgctgaa attgttttac 27600 
acgcctctga atttacacga atatttgaaa 27660 
aatataatgt caaaatttca acggataaac 27720 
ggcagacaat tgaagcagca aaagcagaag 27780 
tgagaatggt atcacttttt ttcaaaataa 27840 
gcattctcca actccacaga atctcttaaa 27900 
tttcccatct ccaccacaag ttattccaaa 27960 
agaaataatt gaagaatctg atgcattctg 28020 
gaaatcttgg aaatgctcgt caagaaaatg 28080 
tcatggatgg tattcatatg atattcattt 28140 
ttgttcagat tttcaataaa cttttaagaa 28200 
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aagaataatt ttaaattcta taattcctga atttccaact atgtttatca tttcccaaag 28260 
tacattcgaa aaagctcaat aagcaaaacg accacgaaat aacagtatta aaaaaaaaga 28320 
tgttgtcatt tgaagttctg gagtgcgatg aaaagtctct cacctcggac tttctgtaat 28380 
ttatttagca tacaacatga atttgaccaa ctcgaaataa ggttaagact gaaaattttt 28440 
cacaaaaatt ggaacacttg cgaagcgaat tcaagacttt tcgaagttat taaacaagct 28500 
ttcaaattct cagtaaaact gaacgttttt tttatgctct ccaaatcatt ttaatatggc 28560 
tgctcgcgtc gctgaagtat tttctagagt atgtttaata aaactaatat gtaaatgaaa 28620 
aaccaaaaac tcagataaag agcataactt ttataacgca ttttcagaac tcttcaagct 28680 
ttttcagatc acttctatca gcagtattct tcttttttcc aaagacacca agaactgaaa 28740 
aggttgaagg agcatcaccg gaaatagagg atgactgctt attgttcttc tttttctgaa 28800 
taaaatcaaa ttaaacaccg aaaatatgaa acatattcac taacctgaac agctttcagg 28860 
tttgatttat tctgattttc cgccgctgat ctgctctgac ttttgaaacc gggacttgga 28920 
gagttaccat tgcgtatgcg agttcgaact ggacgccgat tcttctttct gaataaacga 28980 
attatacaaa tttgtatttg aaaacggaca acatacactc cttcttccgc cgaattgctc 29040 
atcgattttc tcatttcttg tgttttttcc tggcgttcag gttcaaaagg tggagcaact 29100 
ggtttggaca tatacggaag aatgttcgag acttgaatct tttttggttg ctcaatattc 29160 
tccattggaa tatgatcggg aagttcaaag tagctgttgg atcctggagc ttgatcaaat 29220 
ccttcgagag ttaaaagttc acgaactgct tcactcattg tgaccctttc ctcttcggca 29280 
ccagcacaga ttctatactg aaattgcttg ttgtgttgtt ttactcaaaa gaatagtgaa 29340 
caaaattttc tcaccgtaat gaatctgaca atggctggtg ggacgttagc ttcaaatggc 29400 
attcggtatc cgttctgaac acgtggtaaa acctcagcaa ctttcattcc cggataaggt 29460 
tcgattccat catggtacac ttcccaacac atgactccat aagcgaaaac atcagtcttt 29520 
ggagtataga acccagttct tggaacttct ggagccaacc atctaatagg aactctgaaa 29580 
aatttgaaaa ggttggaatt tttgacgttc tctaactttt tgtgaggatt catccgatag 29640 
ctatagcctt ctcgtgacag tccaaagtcg gatatcttta cttgtccatt cccgtagaga 29700 
caatttctgg acgcaatatc gcgatgaatt atttgaagtg aatgaagata ttcaagacca 29760 
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agaccagctt 


gaagaaccat 


cgtatgtttc 


ttggaaattg 


gcaatgaacc 


aatgttcttc 




tttagatatg 


aatccaaagc 


tccattgtca 


gcctaaaata 


atttacataa 


gacatttttt 


29880 


cttagtaaaa 


taaaattaat 


cagttaatta 


attaacatac 


caactccatt 


atgaccatca 


29940 


aaggttcctg 


tcctgcagcc 


acaccataaa 


aagtgacgac 


attcggatgt 


ttgaactttc 


30000 


tcatcaatct 


ggcttcgtgc 


atgatttctt 


tgatctgctc 


ttttgtcaaa 


gattccaact 


30060 


ttgccagctt 


gattgcagct 


tttttgacgg 


tatttcctat 


gcgaatttct 


cccaattgaa 


30120 


cctctccaaa 


tgctccttct 


cctaatttct 


tgattaatgt 


cacgtcagaa 


tgttgctttt 


30180 


cccacggttc 


acgaccaatt 


ggacggatga 


ttacagtttc 


tgggccctaa 


aagcaaacaa 


30240 


atgaaaataa 


gtttactcac 


ttaatttgta 


agatcacccc 


agcaacaggt 


tctttagaac 


30300 


gatgatagta 


attgagaaga 


tctgcgatac 


tagaaaacca 


ttttttatca 


actgcaaact 


30360 


tgttattgtg 


ctctcgaatt 


acataatgac 


gaatctgaaa 


taatattctt 


aaaaattatg 


30420 


agcaatcgtt 


ttacgtacgt 


cctcaattac 


tccaacatag 


acagagagaa 


caaatttcct 


30480 


tggctctccc 


acttttggat 


cagtaaatcg 


aactagaaaa 


tcgcctcgtt 


gagtgagcaa 


30540 


ctgtttcata 


tcctcacgtg 


gcaataagcc 


atggtaccag 


ggttcttttg 


caagtacttg 


30600 



c 30601 

<210> 34 
<211> 8009 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 34 

ggatccttgg ccacgccatg ggcgatgaaa ttgaccgcgt cgtaacgggt catgtcctgc 60 
tcctgcagga agaaggccgc gttcgattcc cgttccgcaa agatcgcgac aaggacattc 120 
gcccccgtca cctcggtccg gcccgagctt tgcacatgga tcgcggcgcg ctggatcacc 180 
cgctggaagg cggcggtcgg cacggcttcc gagccttcga cttcggtgat cagcgtcgag 240 
agatcatcgt cgatgaactc ggtcagggtg gtgcgcaact cgccaagatc gacgccgcag 300 
gcgcgcatca cgcggctggc gtcgggctcg tcgatcagcg cgacgagaag atgttcgagc 360 
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gtcgccagtt catgtttgcg cgtgttggcc 
agcgtggtcg aaaacgaagg catgcggcgc 
catgtgatta agtttcggtg gatttcgccg 
acctctcgcc gctctgttgc aaaactgacc 
cagcgcgcga aagtgctttc agaaccggtc 
gagcggcgcg gcatccccgg gcaggccgag 
caaaaaggga ttcgttgccc gttcctcgcc 
cagccgcaag gccgtcaccc ggtccatccg 
gctgagcgcg aaccggccgt tcgcggcggt 
gggcggcagc gcggccagac gggtcagcgt 
gagacgcccg cagccccagc tcatcaggct 
cagataccag gcgatatggc cgagcgtatg 
catccccaga tgcagcacgt cgcccggggc 
ggcatcggcc gcggcccccg ccaccttggc 
cgcgatgtga tcggcgtggt gatgggtgat 
ggtcagcacc ttcagcaccg gggccgcctc 
ggcggtgtcg tgccagagcc aggcgtaatt 
gagggtcatg gccttttgcg catctttcgc 
caacctgcaa tgcatctcga cgtgctcgac 
gggcgcacgg cgcaaaaggc gatccgcgac 
tccggcatgg ccgggctgac ggtggcgggc 
tatctgggcc gggcgcggcg ggtgatcggg 
tggcccgccg gagagcccaa tgtctcggtg 
accgggatga tcgaccggct ggtggtgctg 
gcgctgatgg aggaatgctg gcgcacgctg 
ccgaaccggg tcgggctttg ggcgccgcgc 



agcgccagtg cggcgtgaat tgcttgctcg 420 
tcctttcctc gggtctcccg atactggcct 480 
cgcttcaagg cccggacgcg tgttttttcc 540 
agcgcggcgg gctttcgcgc gatccgcagg 600 
cttgcgcgcc cgcgccgcgg tgaagacggc 660 
cgcggcgcgc aacgcagcgt catcggcgcg 720 
caaagtcacc ggcaaactgg gttccccggc 780 
gtcgtgcagc cgaccgttcc ccggttccag 840 
gtattcatgc cccgaacaga cccgggtttc 900 
gtcgaacatc tgcgcggggg tcccctcgaa 960 
gtcgccggaa aagagcagcc ccgccccggg 1020 
gccgtcggcc gcgatcacct gcgcggcctc 1080 
caccggatga tcgagcggcg gcagccggtg 1140 
cccggtcgcc tgcgccagcg cctcgacccc 1200 
caggatgtgg tgcagctgcc agcgccggtc 1260 
ggggacatcg accaccacca cggtatcggt 1320 
gtcggtcagg caggggatcg gggtcagttc 1380 
tatcctgacc cagcttcgcc caaggaaggc 1440 
ctgcgtgatt tctactaccg cacccaattg 1500 
aaggtggtcg aactctggcc ggacacccag 1560 
tacggcttcg cggtgccgct gttgcgcccc 1620 
ctgatgcccg cgcagcaggg cgtgatgccc 1680 
ctctgtgccg aaaccagctg gccgctggag 1740 
cacgggcttg aagtctccga cgaccccgat 1800 
ggccccggcg ggcgggcgct gttcatcgtg 1860 
gaaaccacgc ccttcggctt tggccgcccc 1920 
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tatacgatgg gccagctcga ggcgcaggca cgacgggtgg ggtttgcccc cgaacgtcag 1980 
gcggcggcgc tgtacattcc gccctcgcag cggcggttct ggctgcgctc ctccgagatg 2040 
tgggaacggc tgggcacaag ggcggcgggc tatctggcgg cgggggtggt gatgcttgag 2100 
gtgatcaagc aggtgcattc ggtgcgccgc tcggggcttg gcgcggcggt gcgcaagccg 2160 
ctctcgatcc ttgaaggggc gcccaagccg gtggtcgggc ggatgtgagc cgcccgcggc 2220 
cgcaagaatc gcccggccgg aaaagcccgt ttccgcggca cttcgccctg cggcggggaa 2280 
acgcagcggg gcgggcttcg accctttgcg ctaacactcc gtgccggtgc agaaaatgtg 2340 
ccagcctgat gcggattcct gccgccaaga tggttgcgag ggtcttgatg ctctgctaga 2400 
cgcaaccccg aatgcggcgt gcgagatcat tttgggcgcc gaggggggcc tctgaatcgg 24 60 
tgacggaacg attggttccg gtgtccgcgt gcggaggcaa aagcatcgga agggtggacg 2520 
tgtccgaacc agcttcgatt tccgcagcca ttgccgggcg ttatgccacg gccatcttcg 2580 
acctcgcgca ggaggccaag ggcatcgacg cgctctcggc cgacgtggac gcgctgacgg 2640 
ccgccttggc cggttcggcc gagctgcgtg acctgatttc ctcgccggtc tacacccgcg 2700 
aggagcaggg ggacgcgatc gccgcggtgg ctgcgaagat gggcctgtcg gcgccgcttg 2760 
ccaacggtct gaaactgatg gcgacgaagc gccgtctgtt cgcgctgccg cagctgctca 2820 
agggcctggc cgccgcgatc gccgaagcca agggcgagat gaccgcggat gtcacctcgg 2880 
ccaccgcgct gagcgcggcg caggccgaga agctggcggc gacgctggcg aaacagacgg 2940 
gcaagaccgt caaactgaac gtcgccgtcg atgaaagcct catcggtggc atgatcgtca 3000 
agctgggttc gcgcatgatc gacaccacgg tcaaagccaa actcgcttcc cttcagaacg 3060 
ccatgaaaga ggtcggataa atgggcatcc aagcagctga gatttctgcg atcctcaagg 3120 
agcagatcaa gaacttcggg caggatgccc aggtcgccga agtgggccgc gtgctctcgg 3180 
tcggtgacgg gatcgcgcgc gtgcacgggc tcgacaacgt ccaggcgggc gagatggtcg 3240 
aattccccgg cggcatccgc gggatggcgc tgaaccttga agtcgacaac gtcgggatcg 3300 
tgatcttcgg gtcggaccgc gac'atcaagg aaggcgacac cgtcaagcgc accaacgcca 3360 
tcgtggacgt tccggcgggc gaaggcctgc tgggccgcgt cgtggacggc cttggcaacc 3420 
cgatcgacgg caagggcccg atcgtggcga aagagcgtcg catcgccgac gtcaaagccc 3480 
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cgggcatcat tccgcggaaa tcggtgcatg agccgatggc gaccggcctc aagtcggtcg 354 0 
acgcgatgat cccgatcggc cgcggccagc gcgagctgat catcggcgac cgtcagaccg 3600 
gcaagaccgc gatcgcgctc gacaccattc tgaaccagaa gtcgtacaac gacgccaacc 3660 
cgggcaacaa gctgcactgc ttctatgtcg ccatcgggca gaagcgctcg accgtggcgc 3720 
agctggtgaa gaagctcgaa gaagccggcg cgatggaata caccaccgtc gtcgccgcga 3780 
ccgcttcgga cccggcgccg atgcagttcc ttgcccccta ttcggcgacc gcgatggcgg 3840 
aatacttccg cgacaacggc atgcacgcgc tgatcatcta tgatgacctc tcgaagcaag 3900 
ccgtggccta tcgtcagatg tcgctgctgc tgcgccgtcc gccggggcgt gaagcctatc 3960 
cgggcgacgt gttctatctg cactcgcgcc tgctggaacg ttcggcgaaa ctgaacgagg 4020 
atttcggttc gggctcgctg accgcgctgc cggtcatcga aacccagggc ggcgacgtgt 4 080 
cggccttcat cccgaccaac gtgatctcga tcaccgacgg tcagatcttc ctggaaaccg 4140 
aactgttcta ccagggcatc cgcccggccg tgaacaccgg tctctcggtg tcgcgcgtcg 4200 
gttcgtcggc ccagaccaac tcgatgaagt cggttgccgg tccggtgaaa ctggagcttg 4260 
cgcagtatcg cgaaatggcc gcctttgcgc agttcggttc cgaccttgac gccgcgacgc 4320 
aaaagctgct gaaccgcggt gcccgtctga ccgagctgat gaaacagccg caatattcgc 4380 
cgctgaccaa cgccgaaatc gtggcggtga tctttgcggg caccaacggc ttcctcgatg 4440 
ccgttccggt gaaggaagtc ggccggttcg agaaaggcct gctggcctat ctgcgctcga 4 500 
cccgcaagga cgtgcttgag tggct caeca aggaagaccc caagatcaag ggcgacgccg 4560 
agaagaagct caaagacgeg ategecgagt tcgccaagac ettegcttga eggectgaaa 4 620 
ggacagggag atgcccagcc ttaaggacct caagaacegg atcgtgagtg tcaagaacac 4680 
tegcaagate aegaaagega tgcagatggt cgcggcggcg aacattcgcc gcgcccagga 4740 
aagcgccgaa gctgcccggc cctatgccga geggatgaac gccgtgatgt cgagccttgc 4800 
cggtgcggtg ggctcgaccg acggtgcgcc gcgcctactt gegggcaegg gctccgacaa 4860 
ggtccatctc ctcgtcatca tgaegggega gegegggett tgeggegget tcaacgccaa 4 920 
tatcgegaaa etcgegaagg egaaggegat ggaactgctg gcccagggca agacggtgaa 4980 
gatcctcacc gteggcaaga aaggtcgega cgcgctgcgt cgtgatctgg gecagtatta 5040 
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catcgatcac atcgacctga gcgacgtgaa 
ttcgcaaaac atcatcgacc gcttcgaggc 
ctcggtcttc cagagcgtga tcagccaggt 
gttcgaaacc gatgcggcct cggcctcggc 
aatcctgacc gcgctgctgc cgcgtgcggt 
aaacaacgcg tccttcaacg gggcgcagat 
gggtgacatg atcgatcgct tgaccatcga 
caaagagctc atcgaaatca tctcgggcgc 
gagaatggca agcaaaggca aagtgaccca 
cgaagacggc ctcccggcga ttctgaacgc 
cgttctcgaa gtggcgcagc acctgggcga 
gaccgagggt ctcgtgcgcg gcgcggccgt 
ggtgggcaac gccaccctgg gccgcatcct 
cggtgacgtg tcgaaagccg aagcccgggc 
gcagtcgacg gaaagccaga tcctcgtcac 
ctattccaag ggcggcaaga tcggtctctt 
gatcatggaa ctgatcaaca acatcgcgaa 
cgttggcgaa cggacccgtg agggcaacga 
tatcaacctc gagaagctcg aagaatcgaa 
acccccgggg gcccgtgccc gcgtggcgct 
cgaccagtcg ggcaccgacg tgctgttctt 
cggttcggaa gtgtcggcgc tccttggccg 
gctggccacc gacatgggcg cgctgcaaga 
cacctcggtt caggccatct acgttccggc 
gtcctttgcc cacctcgacg .ccacgaccgt 
ctacccggcc gtcgacccgc tcgactccac 



gaaactgagc tacccggtgg cgcagaagat 5100 
gggcgaatac gatgtggcga cgatcttctt 5160 
gccgaccgcc aagcaggtga tcccggcgca 5220 
ggtttacgac tacgaaccgg gcgatcagga 5280 
ggccacggcg atctttgccg cgctgctgga 5340 
gtcggccatg gacaacgcca cccgcaacgc 5400 
gtataaccgc tcgcgtcagg ccgccatcac 54 60 
cgaggcgctc tgacggaacc ggagatagaa 5520 
ggtcatcggc gccgtcgtcg acgtgcagtt 5580 
ccttgaaacc accaacaacg. gcaagcgcct 564 0 
gaacaccgtc cgcaccatcg cgatggacgc 5700 
gtccgacacc ggcggcccga tcaccgttcc 5760 
gaacgtcatc ggcgagccgg tggacgaacg 5820 
gatccaccag cccgcgcccg atttcgcggc 5880 
cggcatcaag gtgatcgacc tgctcgcccc 5940 
cggcggcgcc ggtgtgggca agaccgttct 6000 
agtgcactcg ggcttctcgg tgttcgcggg 6060 
cctttaccac gagatgatcg aatcgggcgt 6120 
agtggcgctg gtctacggcc agatgaacga 6180 
gaccggcctg accctggcgg aacagttccg 6240 
cgtcgacaac atcttccgct tcacccaggc 6300 
tatcccctcg gccgtgggct accagccgac 6360 
acgcatcacc tcgaccaaag ccggttcgat 6420 
cgacgacctt accgacccgg ccccggccac 6480 
tctgtcgcgt gcgatctcgg aactcgggat 6540 
ctcgcggatc cttgacccgc aagtcgtcgg 6600 
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cgaagagcac 


tatcaggtcg 


cccgtgacgt 


gcaggacatc 


atcgccatcc 


tcggcatgga 


ggcccgcgcc 


cggaagatcc 


agcgcttcct 


caccggctcg 


gacggcgtgc 


aggttccgct 


ggttgcgggc 


gaatacgacc 


acctgccgga 


tgacgtgatc 


gcgaaagccc 


agcgcctcgc 


taccatgcag 


ttcgatctcg 


tgtcgccgga 


ggtccgtctt 


cccggcgtgg 


aaggcgatct 


cctctcgctg 


cgtcccggca 


tcctgaccgt 


cgtgaccggc 


ggcttcgccg 


aggtttcggg 


tctgacccgg 


gcggaactga 


ccgccgcggt 


agtcgcggac 


gccgcgcatc 


cgtcggtggc 


ggaagcgctt 


ggctcgcaca 


tcaatctctg 


cggtcatcgc 


gccggggccc 


ttgctttttg 


gaaggtgcag 


gcagccggag 


tgagcgacag 


atcggcgtgg 


cccagggatc 


gctggtgctg 


tggacgggcg 


agggcccgcg 


cgaattgcgc 


gagatcccgg 


cggtgcaggt 


gtcgctgtcg 


cgcatggaca 


tttccgccga 


catggtgacg 


tggggcgaca 


cccgcgtcgc 


ccgcgtccgc 


aatgacgacg 


actgggacgt 


ggcctgatcc 


cgatggtgcg 


cgcgactttc 


ccatccaacg 


ctggcagctc 


gtggtgatcc 


tgtgggggac 


gatcgagacc 


gtgtggcccg 


ggcctcgag 
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ccaagggatg ctgcaacgct acaagtcgct 6660 
cgaactgtcg gaagaagaca agctgacggt 6720 
gtcgcagccc ttcgacgtgg cgaaagtctt 6780 
cgaagacacc atcaagtcgt tcaaggcggt 6840 
agcggccttc tacatggtcg gcggcatcga 6900 
cgctgcggcg taagggggaa ccatggccga 6960 
acggcggctt gcctccgttg ccgcgagcga 7020 
gacggcgatg ccgggccatg cgcccgtcat 7080 
ggtcagcgcc gcgggcacgg ccgaatacgc 7140 
cgagaaggtg accgttctgg ccgagcgcgg 7200 
tcatgccgag atgctggccg aggccaagaa 7260 
cgatgccgcc gcgaagatgc tggccgacat 7320 
acgggacatc ccgccggata tctcgggccc 7380 
cttttgtctt gccgcgccgc atattagcgt 744 0 
gaacggatga agaagttttc ctcgacccgg 7500 
ttttcggatt atctggacgg cggcgtgatg 7560 
aggctggtgg tgttcgacga agccttccgc 7620 
atgtgggaca tcgaccagaa gcacaatccg 7680 
gccgagggct tcgtgatcgt ctttcgcacc 7740 
gcggactggc tggcgatcgg cggctgcgcc 7800 
cgcccggctt gactttccgc ccccccgcgc 7860 
aggcccgccc gtgcaacaag atgccccccg 7 920 
gaaatatccg gtcgccgaac tcaacgccct 7980 

8009 



<210> 35 
<211> 9810 
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<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 35 

gatatcgggc ttgtcatttt cgattgcgac ggggttctgg ttgattcgga agttctggcc 60 
gtggccgtcc tcatcgcaga actggaccgg gcgggcgtgc gggtcgacga ggccttcgtg 120 
catcggcatt ttctgggccg gagcttcccg gctgttcagg aggtcgtgca gcgccagttc 180 
ggcgtgaccc tgcccgagac cttccaggtc gaggaacgtg cccggctgct gtcagccttc 24 0 
gagaccggcc tgcgggccat gctcggggcc gcggagaccg tccgcgcgct gtcggtgccc 300 
tactgcctcg ccacgtcgag cacgccggcc cggctcacgc gctcgctgga gatcacgggc 360 
cttgcggccc tcttcgaggg acgctgcttc accgcgagcc aggtggcgcg cggcaagccc 420 
gcgcccgatc tgttcctgct cgccgcggcc gagatgggcg tcgcgcccga acgctgcctc 480 
gtgatcgagg ataccgagcc cggcgtgcgc gcaggcctcg cggccgggat gcaggtctgg 54 0 
cgcttcaccg gcggtagcca tttcgcgaac cgatcccccg aggatgcgcc cgatgccctg 600 
ccgcaccggc ggttcgacag cttcgaccgt ttctacgaga ccctgcccgg cctgcgccgg 660 
gccaagtgcg agaccctgac atgatcgacc ggcccgagag cgagccgacg cccctcgacg 720 
atgccgcgcg cgcgggctgg ctctattatg tcgcaggcct gactcaggat cagatcgcgc 780 
gggagctcgg cacctcgcgt cagcgggcgc agcggctggt gagccgggcc atctccgaac 840 
ggctgatcca cgtccggctc gagcaccggg tctcgggctg cctgcatctg gaagccgcgc 900 
tcctccggcg cttcgggttg aagctggccc gcgtggcgcc gagtctcggg tccgaggtgg 960 
atcccctgcc ctccatcgcc cccaccgccg ccgccgaggt ggagcgggtg ctgcgctcgg 1020 
agcggccgat ggtggtggcc ttcggcaccg gccggtcgct gcgcgccacc gtcgaggaga 1080 
tgacctcgat ggtctgcgaa cagcacaaga tcgtgtcgct caacggaaat atttctgcgg 1140 
atggctcggc ctcctactac gatgtgatct tccgcatcgc cgaccgtgtg cgtgcgccgc 1200 
actatccgat gccgatgccg gtcatcgcgc aggatgcggc ggagcgggag ctgtttcatg 1260 
cgctaaagcc cgtgcagtcg gtgctgcggc ttgcgcgcaa tgccgatgtg accttcgtcg 1320 
ggctgggaca gatgggcgag gacgcgccgc tcctgaagga cgggttcatc acgcccgagg 1380 
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agctgaccga gatgcaggat ctgggcgccg 
cggagggtcg ctacctcgaa accagcatca 
tttccgagga tcggacggtg gtcgccatcg 
acgcaggctt aaggggccgt cttttcaacg 
cacttctgtc ctgaagccgc cgaaaggcgc 
tcggtgagta attattcgcc gcacgaaata 
gatgaccgca agatttcgcg ccctgatggg 
cgccggcgcc gaaaccatca ccgtggcgac 
ggggctcatg tccgagttca acgcgcagca 
cgaggaaaac gtgctgcgcc agaaggtcac 
cgacgtgctg accatcggca cctacgaggt 
gagcctgaac gacctgccgc cggagtatga 
cggcctgacc gtcgacggcg agctctatgc 
catgtatcgc aaggacctga tggagaaggc 
ggacttcgtg aaggaagcgg cgcagaagat 
ctgcctgcgc ggcaaggccg gctggggcga 
cagctacggc gcgcgctggt tcgacgagaa 
gaaggccacg ctgaccgact atctcgacat 
gaaaaacggc ttcaacgaga acctcgcgct 
cgacgcgacg gtggccgcct ccttcgtgac 
ggtgggcttc gcgctcgccc ccgataccgg 
ctggaacctc gcgatcccgg cgggctcgca 
ctgggcgacc tcgaaggact atgccgagct 
gcctccgggg acgcggacgt cgctctacga 
gaagatgacg ctcgacagca tcaacgcggc 
gccttacgtc ggtgtgcagt tcgtggcaat 
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tcggagaggt ggcgggatgg gtcttcgact 1440 
atcagcgggt tgcgggcgtc cgtgtcgaac 1500 
ccggtggcag gcgcaagctc gcggcgctgc 1560 
gcctgatcac cgacgagttc acggcgcagg 1620 
ggcaaaaagt atttgacagg ctggcacccc 1680 
atgctcaccg tgcaggccag ggaggatact 1740 
cgcgtgcgcc gtggctgcgc tctcgtccgc 1800 
tgtcaacaac ggcgacatga tccgcatgca 1860 
ccccgacatc accgtcgagt gggtgacgct 1920 
gaccgacatc gccaccaagg gcgggcagtt 1980 
tccgatctgg ggcaagcagg gctggctcgt 2040 
tgccgacgac atcctgcccg cgatccgcaa 2100 
cgcgcccttc tacggcgaga gctcgatgat 2160 
ggggctgacc atgcccgacg cccccacctg 2220 
gaccgacaag gatgccgagg tctacggcat 2280 
gaacatggcc ttcctcagcg ccatggccaa 2340 
ctggcagccg cagttcgacg gcgaggcctg 2400 
gatgacgaac tacggcccgc ccggcgcctc 2460 
gttccagcag ggcaagtgcg gcatgtggat 2520 
caaccccgag gaatccacgg tggccgacaa 2580 
caagggcaag cgggccaact ggctcggggc 2640 
gaaggtcgat gccgccaagc agttcatcgc 2700 
ggtggcctcg aaggaaggct gggccaacgt 2760 
gaacccggaa tatcagaagg tgccgttcgc 2820 
tgacccgacc cacccggcgg tcgatccggt 2880 
ccccgagttc cagggcatcg gcaccgccgt 2940 
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gggccagcag ttctcggcag ccctcgcggg 
ggcccagcag ttcacgacgc gcgaaatgac 
gcgggccggc cctgagcggc cggcccgcac 
ggcctttccg ccccatcagc cccgaggcct 
cgcgtctgat gatttccccg gccgtgatcc 
cgatgacgct ctacttcagc ttcctgcgct 
tcaccggctg ggacaattac tattacttcc 
ccaacacgat cctcctcgtg gtcggcgtcc 
tcgcgctcct gctcgaccag cccttctggg 
ctcccttctt cgtcatgccc accgtctcgg 
ccgtgaacgg gatgttcgcc catatcgccc 
tgtcgcaggc gccgctggcc tcgatcatcg 
ccacgctgat ccttctgacg gcgctccagt 
agatggacgg cgcctcggcg ctcgaccggt 
gtgccatcac cgtggtggtg ctgatccaga 
tcctcgtcac gacgaacggt ggacccggca 
atgcgcagtc gctcctgaat tacgacgtgg 
tggtgctcgc caatatcgtg gcgatcttcc 
cctgacatgt cacgccgcac ctcaacccgc 
acgatagcct tcctcatctt cttcccgatc 
gaaggagacg ccatcaaggc gcccttcgcc 
tacgccgatg tgcaggaacg gtcgaactac 
tcgctgggct cgaccctcgt ggcgctcgcc 
ttcgtgccgg gccggcggac gaaggacgtg 
ccggcggtgg gcgtgctcat cccgctctat 
acgcggatcg gcctcgtgat cgtgctcacg 



ctcgatgtcg gccgagcagg cgcttcaggc 3000 
ccgcgcgggc tacatcaagt gagcccttcc 3060 
cgcttgccgc ttccggccgt atccgccgga 3120 
ccatggcgac ccagcattca aagactgcgg 3180 
tcctgttcct gtggatgatc gtgccgctgt 3240 
acaacctcct catgccgggg atggagagct 3300 
tgaccgatcc ggccttctcc gcggccctga 3360 
ttctcatcac cgtggtgggc ggggtcctgc 3420 
ggcagggcat cgtgcgcgtg ctggtgatcg 3480 
cgctggtctg gaagaacatg ttcatgaacc 3540 
gcgggctcgg ccttccgccg ttcgacttcc 3600 
gcatcgtggc ctggcagtgg ctgcccttcg 3660 
cgctcgaccg cgagcagatg gaggcggccg 3720 
tcatccacat caccgtgccg cacctgacgc 3780 
ccatcttcct tctgggcgtc ttcgccgaga 3840 
ccgcctcgac caacatcacc tacctcgtct 3900 
ggggcgggtc ggccggcggc atcgtcgccg 3960 
tgatgcgcat gatcggcaag aatctggacg 4020 
cgcacgctga tcgtcacgct cgccgcctgg 4080 
ctctggacgg tgctgatgag cttcaaatcg 4140 
atgctcttct cggactggac cctgcaatcc 4200 
gcccgccact tcatgaattc ggtggtgatc 4260 
atcgcgatcc ccgccgcctg ggccatggcc 4320 
ctgatgtgga tgctgtcgac caagatgatg 4380 
ctgatcttcc gcgacacggg ccttctcgac 44 40 
ctcatcaacc tgccgatcgt ggtctggatg 4500 
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ctctacacct acttcaagga gatcccgggc 
gcgacgctcg gctccgagat cctctatatc 
tcgacgctgc ttctgaacgt gatcctcgcc 
accacctcgc gggcggcccc gctcacgcag 
ctcttctacg ccaaactgtc ggcggcctcg 
ggctggttca gccagaaaca actcgtccgc 
ccacatgggc aagataaccc tgcgcaacgt 
cccctcgctc gacctcgaca tcgaggatgg 
ctgcggcaaa tccacgctcc tgcgcctgat 
gatcatgatc gacgggcgcg acgccaccga 
ggtgtttcag agctacgcgc tctatccgca 
gctgcggatg gcgaagatgg agccacagga 
gatcctgaac ctcaccaact atctcgaccg 
gcagcgggtg gccatcgggc gcgccatcgt 
gccgctctcg aacctcgatg cggcgctgcg 
gcaccagtcg ctcgagacca cgatgatcta 
catggccgac aagatcgtgg tgctgaacgc 
caccctctac cgcaatccgg cgaacctctt 
gaacctgatc gaggggcccg aggccgccaa 
cgaacatatc gacctgtcgc gcgaggcggg 
acatctcggc tcggacacgt tcctgcatgt 
gcggacgggc ggagagttcg gcgtccatca 
cgacaagatc caccgcttcg gcgccgacgg 
agaccgccct catcaccggc tcggcgcgcg 
tgcgtgaagg cgcgcgcgtg gccatcgccg 
cggccgagat cggccccgcg gcctgcgcca 



gagatcctcg aggcggcgcg gatggacggg 4 560 
ctcacgccga tggccgtgcc gggcatcgcc 4 620 
tggaacgagg ccttctggac gctgcagctg 4 680 
ttcatcgcga gctattccag ccccgagggc 4740 
accatggcca tcgcgccgat cctgatcctt 4800 
ggcctgacct tcggcgcggt gaagtgagga 4860 
ccagaagcgg ttcggtgagg cggtcgtcat 4 920 
cgagttcgtc gtcttcgtcg gcccctcggg 4 980 
cgcgggcctc gaggatgtgt cggacggcca 5040 
gatgccgccc gcgaagcgcg gcctcgccat 5100 
catgacggtg aagaagaaca tcgccttccc 5160 
gatcgagcgg cgcgtgtcga acgcggccaa 5220 
ccgccccggc cagctctcgg gcgggcaacg 5280 
gcgcgagccg gcggccttcc tgttcgacga 5340 
ggtcaacatg cggctcgaga tcaccgagct 5400 
tgtcacccac gatcaggtcg aggccatgac 54 60 
gggccggatc gagcaggtgg gctcgcccct 5520 
cgtggcgggc ttcatcggca gcccgaagat 5580 
gcacggcgcc accaccatcg ggatccgccc 5640 
ggcgtgggag ggcgaggtcg gcgtctcgga 5700 
gcatgtcgcg gggatgccca ccctcaccgt 5760 
cggcgaccgg gtctggctca cgccgcaggc 5820 
aaaggcgctc tgacatgcgg ctcgacggca 5880 
gcataggccg cgccttcgcc gaggcctatg 5940 
acatcaacct cgaggcagcc cgcgccaccg 6000 
tcgccctcga cgtgaccgat caggccagca 6060 
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tcgaccgctg cgtggccgag cttctcgacc 
atgcggccct cttcgatctg gcgcccatcg 
tgttcgcgat caacgtctcg ggcacgctct 
tcgcgggcgg ccggggcggc aagatcatca 
aggcgctggt gggcgtctat tgcgcgacca 
cggggctgaa cctcatccgc cacgggatca 
acggcgagca ctgggacggg gtggatgcga 
gcgagaagaa gcgtcaggtc ggcgcggcgg 
acctgaccgg catggcgatc ttcctcgcca 
cctacaacgt ggacggcggc aactggatga 
tcgaacgcgc cccctatccg aggtaatccc 
gaccgcaagg cgctcactcc cggcatcgtc 
catcaggcgg tctatctcga cgatctcttc 
ctcggcgcgg gcgtccgccc gaccgatgcg 
aatctctcga cggtgatcga gctcgatccg 
atggtgggct tcctgccggt cgaggccgac 
ccgcgcatcc gcatcgtctc gctgaccgtg 
ggcgccttcg atccgacgca tcccgatatc 
gcgaccgcct tcggcgcgat cctcgccgcc 
cccttcaccg tgatgtcctg cgacaacctc 
gtggtgggcc tggccgagct ctacgacgcc 
gccttcccga acggcatggt cgaccgcatc 
ctggcgcagg gcttcggcct cgccgatccg 
tgggtgatcg aggatcattt ccccgccgga 
ttcaccccgc atgtccatgc ctacgaggcg 
gcggtgatcg cctatccgtc ggcgctcatg 



gctggggcag catcgacatc ctcgtgaaca 6120 
tcgagatcac ccgcgagagc tacgaccggc 6180 
tcatgatgca ggcggtggca cgggcgatga 6240 
acatggcaag ccaggccggc cgccgcggcg 6300 
aggccgccgt catctcgctc acccagagcg 6360 
acgtcaatgc catcgccccg ggcgtggtgg 6420 
agttcgccga ctacgagaac ctgccccgcg 64 80 
tgcccttcgg ccgcatgggc cgcgccgagg 6540 
cgcccgaggc cgactacatc gtggcccaga 6600 
gctgaggccc aaggcccggc cctccccccg 6660 
atgacccgct ccgtcacccg tccctcctat 6720 
catatcggcg tcggcaactt ccaccgggcg 6780 
gcgctgggcg agggccacga ctgggccatc 6840 
cggatgcgcg aggctctggc cgcgcaggac 6900 
gcgggccacc gggcccggca ggtgggggcg 6960 
aatgcggccc tgatcgaggc catgtcggat 7020 
accgagggcg gctattatgt cgatgcctcg 7080 
gtggccgatg cggcccatcc tgcgcggccc 7140 
ctccgcgccc gccgcgacgc gggggttaca 7200 
cccggcaacg gccatgtcac ccgcaacgcc 7260 
gagcttgcgg gctgggtgaa ggcgcaggtg 7320 
acccccgcca ccggcccgca cgagcgcgaa 7380 
gtgcccgtca cctgcgagcc gttccggcag 7440 
cgccccgcgc tcgagaaggt gggcgtgacc 7500 
atgaagatcc gcatcctgaa cgggggccat 7560 
gacatccagc tcgtgcacgc ggccatggcc 7620 
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catccgctga tcgcggcctt cctgcacaag 
ccgcccgtgc ccgacaccag catccccgac 
aaccccgaga tcgccgacac gacgcgcagg 
aagttcatcg tgccgtcgct gcgcgacaat 
gtgctgctct cggcgctctg gtgccgctac 
gtcgagccga acgatccgaa ctggacggcg 
acgccggccg agtggctggc gatgaccgaa 
ctggcggccg agttcgcggc agccctcgag 
ctgcggcgct tcctcgcggc ctgatccgca 
gcccctgccc cttgcggcgc accgtgaggc 
ctcggtaaca ccatggtatc gcgcaagaat 
ccgggcgagg cgccggactt gtcaaggcgg 
cccgttagca cagtggtagt gcagcgctct 
acacggggca cgcgatcctc cctccgcatc 
cggaaggcac ctgcccgctt gtgcgccgcg 
ggcttgcgga aagccgtgcg cctcggttct 
ctcagcgcgc agccggcgga cagaagccgg 
ggatcgtcgt tcattcatca ccaaggccgc 
cgcgccggcg cttgcccagt ccgcgcccaa 
gaaatcgctc gacacgatct tcggcggcgc 
caccgacggc aacttccaga tccaggtctt 
ggccgccgac gccgtgaccg agggcaccgt 
ctggggcaag gatcccacat tcgcgctggc 
cggcatcaac gcctggcact accatggcgg 
gcagcacaac atcgtggcct tcccgggcgg 
ccggcgcgag atcaacaccg tggccgacat 
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gtcgaggtcg aggagatcct gccccatgtc 7680 
tatcttaccc tgatcgagag ccgcttctcg 7740 
ctctgcctcg acggttcgaa ccggcagccg 7800 
ctggcggcgg gcacggtgcc gaaggggctg 7860 
tgcttcggca cgacggactc gggcgttgtg 7920 
ctgcaggacc gggcgcggcg ggcgaaggag 7980 
gtctacggcg atctggcgca gaacgatctt 8040 
gcggtctggc gcgacggggc cgaggcggtg 8100 
gggcccagcc gctcggagca ccgaagcgga 8160 
gaaacgaccg ggccaccccg gggccaccgc 8220 
gccggcgcct ctgccgaacg ggcccggctg 8280 
cggccctcgg gtagagaggg cgggcgtggc 8340 
tgtaaagcga aggtcgttcg ttcaaatcgg 8400 
ggcgctcgcc cccggtctgg actgcctctt 84 60 
ccctttcctc gcttcccaag cgtctgtcac 8520 
ggacagccgc cccttgcggt gtaatctgcc 8580 
cccgccacgt ccacaaggga ggaatgccat 8640 
cgtgggaggg gccgccgcga gcgccctcgc 8700 
ggtcacctgg aggctcgcct cctccttccc 8760 
cgaagtgctg tcgaagatgc tctccgaggc 8820 
ctcggcgggc gagctggtgc cgggcctgca 8880 
cgaatgctgc cacacggtcg gctactatta 8940 
cgcggccgtg cccttctcgc tgtcggcgcg 9000 
cgggatcgac ctctacaacg atttcctcgc 9060 
caacaccggc gtgcagatgg gcggctggtt 9120 
gcagggcctg aagatgcggg tcggcggctt 9180 
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tgcggggaag gtgatggagc gtctgggcgt cgtgccgcag cagatcgcgg gcggcgacat 9240 

ctatccggcg ctggagaagg ggacgatcga cgcgaccgaa tgggtcggcc cctatgacga 9300 

cgagaagctc ggcttcttca aggtggcgcc ctactactac tatcccggct ggtgggaagg 9360 

cggcccgacc gtccatttca tgttcaacaa gagcgcctac gaggggctga ccccggccta 9420 

tcagtcgctg ctgcgcaccg cctgccacgc ggccgatgcg aacatgctcc agctctacga 94 80 

ctggaagaac ccgacggcga tcaagtcgct ggtggcgcag ggaacccagc tcaggccctt 9540 

cagccccgag atcctgcagg cctgtttcga ggccgcgaac gaggtctatg ccgagatgga 9600 

agcctcgaac cccgccttca agaagatctg ggactcgatc aaggccttcc gctccgagca 9660 

ctacacctgg gcgcagatcg ccgaatacaa ctacgacacc ttcatgatgg tgcagcagaa 9720 

cgccggcaag ctctgagccc gagcgccgcg cgaaagagga ccccggagcc gcgttccggg 9780 

gtcttttcat gggcgacagg ggccggcgcg 9810 

<210> 36 
<211> 1886 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 36 

tgagtgtcta ttttttttcg ggttttttta agtgtgaatc acatggttag gagcagttgt 60 
cttcaatgtg accaaccatc ccaaggctct aattcaacgt ttgggtgtgg gggcccgctg 120 
gcagctgtgt gtgccactgg gctgttggtg ttggtgcttt actccccctc atcgcaaacg 180 
gctaattggt cggcacaggg tatttccaca aaggcgctgt atccggcagt gcctgtgcct 240 
tccactctgc tgcctggaag cgcgcctgcc aaacaccagc tgcatgtttg gagggcacat 300 
gcgatgtcgg aggccacaac aaacaattca ttcaaacagt cattatttgg gtacaatgcc 360 
atctcctcca tttggcttca actggctggt gtggccgcca ctttctttgc atttggagct 420 
ttgatggcag ctgtaacgca acgcaaggag atcgccgtct tctccgcctc gggtcaggct 480 
gctgagccgg agggggcgga gcccctgaag cggccttttc cgtctcctgc tgccaaacct 540 
aagccgctct tctccacccc ggcaaattcc ttcagcaaca tcttccaggc gcctccatcg 600 
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ctgcgcacgg actccaccta tggccgaggc 
aactggccct ccaacaacgc actccgcaac 
gtcgacttcc tgggggacag aagccctgga 
agctccaccc tcagcaacct cggcatgggc 
ttcggcaagg gcttcggcaa aggccggggg 
caggaggtgc tgtcgtgggc cgaccgccgg 
atgaacgagg acatcaagta cccgcagctg 
gacgagaagc tcggtgtgat gtcgaggcag 
atcgacctcg tcctcgtcag catcgacacc 
tcgaagttga agtacgagtc cgagaagaag 
aaggaggtga aggagctgaa ggtgtcccat 
gtgaagcagg cccgaaagtt cctggagggc 
aaggggcgcg agaaccagtt cgtggagatc 
gacctggcgg acatgggcaa ggcggacgcc 
ctgaacctgg ccccggccgg ggaggcgctg 
gacaggaaag ccgcggctga ggaggagggg 
aacgaggacg aggatgtgga gggggagggc 
gagacagcgg aggggacgga ggtgccaacc 
ccacctcccc cccggcctcc ctgccggggg 
tgccatcaac ggccgtcctt gagcttaatg 
ccaggttgct ggcctggctg cccgcagctc 
accgtgatgt gcgccttttc gccttc 

<210> 37 

<211> 3015 

<212> DNA 

<213> Unknown 
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ccgcgctcga ccagcttcac cgacatcagc 660 
ccccagtcgg tgattgacat cgggggagga 720 
aacccgttca cgcggctgcg ggggtccccg 780 
ctaggcctgg ggctgggcaa gggcaagggc 840 
ttccccgtgg aggaggaggt ggaggaggag 900 
cgggcgctgg cggaccccga cgccccgccg 960 
cggctggtgc gggccgtgcc gggcggccgg 1020 
gaggcgctgg agctggcgga ggcggaagac 1080 
gaccccccgg tggccaagct agtcaattac 1140 
aagaaggaca gccacaagaa ggggaaggtg 1200 
aagatcggcc agcacgacta cgacgtccgc 1260 
ggccaccgca tcaaggtgtc gatggagttc 1320 
ggccgcgcgg tgatgaagcg cttccagaac 1380 
gtgcccaaga agctcggcac ccggctgatc 1440 
aaggtgattg cggagcggag ggcagagcgc 1500 
gagggcgacg acctcgactt cgtggacgag 1560 
gaggaggaag aggccgagga gctggaggag 1620 
cgcagctgat cgccgatccg cgggggacag 1680 
ccggcaccat ccgtcgttgc ggtgcggcgc 1740 
ctcccgccct ccgttggccc gcggcggtcg 1800 
ctcccctccc cgactgacac agtgtggatg 1860 

1886 



<220> 
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<223> Description of Unknown Organism: Unknown 
<400> 37 

ccgctcatct ccaggcctcc ctgagtgcgt acccgagagc ggcaagtaga gaaaggaaca 60 
cagatacagc accatggcct ctaggctcgt ccgtgtgctg gcggccgcca tgctggttgc 120 
agcggccgtg tcggtcgacg cgcgcttcgt ggtgcgcatg gtgcaggtgg tacaccgcca 180 
cggtgcgcgc agcgcactca tcgacgacaa cacgacggag atttgtggca ccctgtaccc 240 
gtgcggtgag ctgaccggcg agggtgtcga gatggtccgt gctatcggcg agtttgcccg 300 
cagccgctac aacaacctct cattggtgga gagccctctc ttcccgtcga cgcggtacaa 360 
ctcctctgtc gtgcacacac gctccaccca cacccagcgc accatccaga gcgcgaccgc 420 
ctttctgcgc ggcctcttcc aggacgacta cttctacccg gtggtgtact cgaccaacag 480 
aacgaccgaa acgctgctca gcactgacgc ggtgccgtcc gtggtgggcc gtagctggct 540 
cgacaacccg gcgctgcacg ccgccctcaa cccggtgatc gatgagcacc tcagctggga 600 
cgccatccag agcgctgcca aggacgcatg ggtcgagggc ctgtgcgcgg actacaacgc 660 
ccgcaccaac tgcgtcctcg acatgtacga cgtggccgcc gccttcgagg ccgccgggcg 720 
tcttgacaat gccaccaatc tcaaggcggt gtatcccggc cttcaggagg tgaacgccgc 780 
ctggttcaag tatgtcttca gctggaacca cacgagcaag ctcgatctca cgcagggctc 840 
cgcctcgcag aaccttgcgc agacggtgct ggccaacatc aacgcccacc gcctctctcc 900 
gtcgtacaac atgttccagt acagcgctca cgacacaacg gtgactccct tggctgtcac 960 
gttcggtgac cagggcgaga cgacgatgcg tccgcccttc gcggttacca tcttcgtgga 1020 
gctgctccag gacaccgcag atgccagtgg ctggtacgtg cgcctcatcc gcggcaaccc 1080 
tgtgaaggca gccgacggca cctatgtctt ccaggagtct ggtatcaagg catactgcat 1140 
cgatgaagcc gggaacaagt acctcgcaca caccggcatc tgcccgctga atagcttccg 1200 
ccgcatggtc gactactcgc gccccgccgt ggctgacggt cactgcgcca tgacacagac 1260 
tcagtacagc aacatggatt gcccgcgcac tatcgcggac aacaagccgg tgccgtcgcg 1320 
ctgctggctc taccgccacg tttgccctag caaggcatgc ccggacagct acattctctc 1380 
cgcggtcgac caccagtgct accccgggcc cgacgttacg aaccccacca gcagcagcag 1440 
cagcgagggt accaccacca gcagcagcga gggtaccgcc accagcagca gcgacgttac 1500 
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caccaccagc agcagcgagg gtaccgccac 
cgagggtacc gccaccagca gcagcgacgc 
caccagcagc agcgagggta ccaccagcag 
cgctaccacc accagtagca gcgagggtac 
cagcagcgag ggtaccgcca ccagcagcag 
cgccaccagc agcagcgacg ctaccaccac 
cagcgacgct accaccagca gcagcgaggg 
cagcagcagc agcgagggta ccaccagcag 
cgttaccacc accagcagca gcagcgaggg 
cagcagcagc gagggtaccg ccaccaccag 
gggtaccacc agcagcagca gcgacgctac 
cagcagcgac gctaccacca gcagcagcag 
taccaccagc agcagcgagg gtaccgccac 
cagcgagggt accaccagca gcagcagcga 
caccaccagc agcgacgcta ccaccagcag 
cgacgctacc accaccagca gcgacgttac 
caccagcagc agcgacgcta ccaccaccag 
cgagggtacc accaccagca gcagcagcag 
cccttccttc aaaaagcccg cgaactggag 
ccacattgcc ggggacatca tccgccgcgt 
ccgaaagcac gatgagtaca gccggcaccg 
ttgtgcatgt ttgaatggag aggaggcttc 
cacaaccgct cgtttcaccc ccgtcatctt 
ctttgtgggc tctctctggg tgtgggcgct 
ccatcgctct tacctgtggg ctattttttt 
cgcgtccctc tgcag 



-152- 

cagcagcagc gacgctacca ccagcagcag 1560 
taccaccagc agcagcagcg acgctaccac 1620 
cagcagcgac gctaccacca gcagcagcga 1680 
cgccaccagc agcagcgacg ctaccaccac 1740 
cgacgttacc accaccagca gcgagggtac 1800 
cagcagcagc gagggtacca ccagcagcag 1860 
taccgccacc accagcagcg acgctaccac 1920 
cagcagcgac gctaccacca gcagcagcga 1980 
taccgccacc agcagcagcg acgctaccac 204 0 
cagcgacgct accaccagca gcagcagcga 2100 
caccagcagc agcgagggta ccgccaccac 2160 
cgagggtacc accagcagca gaagtgacgc 2220 
caccagcagc gacgctacca ccagcagcag 2280 
cgctaccacc agcagcagcg agggtaccgc 2340 
cagcagcgag ggtaccacca gcagcagcag 2400 
caccaccagc agcagcagcg agggtaccgc 24 60 
cagcgacgtt accaccacca gcagcagcag 2520 
cagcagcaaa agcacaagtt catcggatgt 2580 
cccgcgcgtt ctctcgccgg aaaggggccg 2640 
gacgaacggt gttacgatcg gtgcgggtgt 2700 
ccaacagtag cacaacggca tgtaactctt 2760 
tgtacagcgt acattgtttc gagaaggtat 2820 
ttcattttga tctccgtcgt ctcatactgc 2880 
tgtgcgtgtg tcgctgtaaa gtcgttgacg 294 0 
aattatggtt tattattact tccctctctg 3000 

3015 
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<210> 38 
<211> 38186 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 38 

gatccttcct gcctcttccg gcgtctgggg ctccaggcgc ccttggcttg cagattgatc 60 
tccctgatct ctgcctccat ctgctcacag ccttctcccc tgtgtgtctc tgtctcttct 120 
tgtaaattca tccgtcgttg gatcagggcc cacccggttc ctcgtggcct cgccttaact 180 
gggccatgtc tgcagagacc ctatttccac ataaggtcct attcacagga accgggggtc 240 
aggatgtcag cctgtctttc tgggagatgt agttcaaccc acaacacaca tcaaacagtt 300 
attgagcgcc gactgcgtgc cctgccgtgt gcttgaaggt cccaccctca ggaagcgggg 360 
cctagggatg gcggccgtga tcacgcaggc agcagagagc agctctggga agcggggagg 420 
gacgaggacg gggaggcgac atcagcaagg ccgtgtgtga gccaggcagg gtgtccccgg 480 
tgtagcacct ggctcgggca gaggccccga ggaggggctg gaggagctgg gcgaggaggc 540 
gggcaggacg ggcctgacac tagggacctc gggccccggg aatgcctctg ggggggcgtg 600 
tacacccgtt gctcccagga ggcacacact gcggttcgct tcgccaagaa tgtttaattg 660 
catttgatga ctacggtttc cattcattca tttgtagaga tataacactc agaccacaaa 720 
atgcataaaa tgcggtggct tttagtatta acagagtgct gcacccgata ccacagcctc 780 
actccagaac attctcatgg gcccaaaagg agacctgggg tgttagtcac cagctcactc 840 
cccgtcccca gcccctggca acccacgcta cttagtcatt atttaggtgt ttaggagttg 900 
caaagtcaaa tctttaaacc cacatatggc caggcgtggt ggctcacgcc tgtaatccca 960 
gcactttcag aggccgagac gggcagatca cctgaggtca ggagttcgag accagcctgg 1020 
ccaacatggt gaagccccgt ctccactaaa aatacaaaat tagccgggcg tggtggtggg 1080 
cgcctgtaat cccagctact ctggaggctg agacaggaga atcgcttgaa cccaggaggc 1140 
ggcggttgca gtgagccgag attgtgccac tgcactccag cctggacaac agagcgagac 1200 
tccgtctcaa aaaaaaaaaa agtaccaaaa agtgccccag gtcataaggg cacagctcga 1260 



WO 99/37660 



-154- 



PCT/US99/01313 



tagctggtcc ctaaagggaa cgtggtgtaa 
tgccgtcctt agaagctgcc tttgctaagg 
tcatctccag acccttggtt ttcatgttga 
aatgtagctt tccactggtg tgtgtgtgtg 
atggagtctc gctctgtcac ccgggctcca 
cctcctcctc ccgggttcaa gagactcgtg 
gcacccgcca ccacacccag ctaatttttt 
gttggccagg ctggtctcga actcctgaca 
agtgctggga ttacaggtgt gagccaccac 
tttttttttt ttgagtcagg gtcttgttct 
atcacggctc actgcagcct cgacttcctg 
ccggtagaga cgggggtctt accgtgttgc 
cgatcctccc gcctcggcct ccgcaaatgc 
gcctcctttc caccgctctt gtctacagcc 
gtcgccaata cggtgtcaaa cggcgaaggg 
tggcggcagc agctcggttc cggctacggg 
gggtccgaga ggtgagccgg tcccagaggc 
cagcggaccc agccctgcct cccccctgca 
ggtggcctcc atgccgctgt tcgccaacgc 
caagctcaag ttcgaggtct tccagccggg 
gaagaagatg tacttcatcc agcacggcgt 
gatgaagctg tccgatggct cctacttcgg 
gggagggggc acgcgacccc cgcggtgtgc 
ccggggatgg ggatggggat ggggatgggg 
ccggggatgg ggatggggat ggggccgggg 
atggggccgg ggatggggcc ggggccggca 



ccaccacaca gaacgaagct ggaacgttcc 1320 
ggaattgccc tgacttccca caccattgat 1380 
tttttcaaaa atcacctgat agtctgaccg 1440 
tgtgtgtgtg tgtgtgtgtg tgagagagag 1500 
gtgcagttgt gtgatcttgg ttcactgtaa 1560 
cctcagcctc ccgagtagct gggattacag 1620 
gtatttttag tagagatggg gtttcaccat 1680 
tcaggcgatc cacccacctt ggcctcccag 1740 
gcccggcctt atttttcccc cattttcttt 1800 
gcgctcaggc tggagggcag tggtgtgggg 1860 
caccaccacg cctggctgtt tttttttttt 1920 
ccaggctggt ctagaactcc tgggctcaag 1980 
tgagatcaca cgcgtgagcc cccgcacccg 2040 
gcccctcctg gtccgattgt attggcagat 2100 
gcactgagcg ttttttcttt ctcccgtcct 2160 
gctgagcccg tctctcagac gaggaaactg 2220 
agggcgaggg ggaagcggga gtggggtccg 2280 
ggagatcgtc aacttcaact gccggaagct 2340 
cgaccccaac ttcgtcacgg ccatgctgac 2400 
tgactacatc atccgcgaag gcaccatcgg 2460 
ggtcagcgtg ctcactaagg gcaacaagga 2520 
gggtgagctt gaggggggcg cgcctggagg 2580 
agagccaggg ggccggggcc ggggccgggg 2640 
ccggggatgg ggatggggat ggggatgggg 2700 
atggggatgg ggccggggat ggggccgggg 2760 
ccagggagag cctgggtggg aagcgcccac 2820 
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gctggccaag gtgcagaggc cgggccgtgt 
tcgtccagca acccccccct gcgcgccacg 
ccgcacggcg agcgtgcggg ctgacaccta 
cttcaacgag gtgctggagg agtaccccat 
cgaccgcctg gaccgcatcg gtgagcgggc 
gggggagggg cgtggccaag gcatcaggag 
gtggctgtgg catcaggggc acggttgggg 
gccatggcag caggggcgtg gctggggcag 
ttgggtctag aggctgattt tctgacctat 
cccaagggag ggaatgcaca gggtgtttgc 
tgtgtgctgg gggtgcaggg gacagacccg 
tggactggtg atgaaatcac acacgactgg 
tgggcttccc tgagttggga atgcagagtg 
cggggaggaa gatgtgttcg tggagtggac 
agaggcctag agtccagaga gccagggagg 
cagacacaca gggcccagtg ggcggcaggg 
agccatggag ggttctaggt gggcggagga 
cgtgtccagg agggagacgg tgacctggcc 
cggcagagcc ctgatgactg gatgtaggcg 
gtgggaatgg gaatgaagac catggctgaa 
gggagctccc tggtgtacag taggaagctc 
aacccgttcc tgagtatcag gtgcttaggt 
acaggctgtg tacaggcagg tgttttcggt 
cataatcagc gtatacaggt ggtacacatg 
tgtacacagc aggcattaaa tacctgttta 
cccagcactt tcggaggcca aggtgggtgg 



-155- 

gcctgggcgg ggagggccgc ggcgcccgcc 2880 
tgcagagatc tgcctgctca cccggggccg 2940 
ctgccgcctc tattcgctga gcgtggacaa 3000 
gatgcggcgc gccttcgaga cggtggccat 3060 
cgggggcgtg gccggggcgg gtgccctggc 3120 
agtggcttgg acagtggcag ggggaagggc 3180 
cagagacgtg gccaaggcat caggagtgtg 3240 
gggcagcggc tggccgctcc taggacccct 3300 
tgtcctactt cagccagagg cagcctgttt 3360 
ggttgtgccg aatgctcggt gagcacctgc 3420 
ggggcccact cagactccca gggaggctta 3480 
gctgtgtgcc agcagggcag gtggggccgg 3540 
gagaccaggg taagggatgc catgtggaaa 3600 
acagcacatc ccaaggccct gaggtggaaa 3660 
cctggaggag gttggggaag aaggggaggc 3720 
agagtttaga ctaaatcagg agcatcaggg 3780 
cctggtcaga ttgtatccgc caaggcgggc 3840 
tctcaggggg gcagtctctg gggcagggag 3900 
ccagagagat ggcggctcat gctgctgttc 3960 
acgcaggaca ggtgcgacgg agtggtgtca 4020 
tccacaactt gctctataca gtgagtatgc 4080 
tataacttct gtatacagca ggtgctcagc 4140 
atgcctgtgg cacactggag gcagtcatta 4200 
catacttggt gcacagtgat acctgctcca 4260 
ctgccaggcg cggtggctca cgcctgtagt 4320 
atcacgaggt caggagattg agaccatcct 4380 
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ggctaacatg gtgaaacccc gtctctacta aaaaaaaaat acaaaaaatt agccgggtgt 444 0 
ggtggcgggc gcctgtagtc ccagctactc gggaggatga ggcaggagaa tggtgtgaac 4500 
ccgggaggtg gaccttgcag tgggccgaga tcgcgccact gcactccagc ccgggcgaca 4560 
gagcaagact ccgtctcaga aacaaagcaa aacaaaagcc ctgctttctg tatgcaggtg 4620 
cttcatgcat gctggctgtg catagcaggt gctcagcctg tatatggcag gtactcaata 4 680 
tccatactat aggccagaga tgctacatat gtgcttattg tatacagtag gtggtaaatg 4740 
catgcttgct ctacacggca agcactgtgt gcgcacccgc ggtgcagagt aggtgctcgg 4 800 
tgcccgctgt acgcagcagg cgctccctgt gcacacgcta acgccccctc tcccgcaggc 4860 
aagaagaatt ccatcctcct gcacaaggtg cagcatgacc tcaactcggg cgtattcaac 4 920 
aaccaggaga acgccatcat ccaggagatc gtcaagtacg accgcgagat ggtgcagcag 4 980 
gccgagctgg gtcagcgcgt gggcctcttc ccgccgccgc cgccgccgcc gcaggtcacc 5040 
tcggccatcg ccacgctgca gcaggcggcg gccatgagct tctgcccgca ggtggcgcgg 5100 
ccgctcgtgg ggccgctggc gctcggctcg ccgcgcctcg tgcgccgccc gcccccgggg 5160 
cccgcacctg ccgccgcctc acccgggccc ccgccccccg ccagcccccc gggcgcgccc 5220 
gccagccccc gggcaccgcg gacctcgccc tacggcggcc tgcccgccgc cccccttgct 5280 
gggcccgccc tgcccgcgcg ccgcctgagc cgcgcgtcgc gcccactgtc cgcctcgcag 5340 
ccctcgctgc ctcacggcgc ccccggcccc gcggcctcca cacgcccggc cagcagctcc 5400 
acaccgcgct tggggcccac gcccgctgcc cgggccgccg cgcccagccc ggaccgcagg 54 60 
gactcggcct cacccggcgc cgccggcggc ctggaccccc aggactccgc gcgctcgcgc 5520 
ctctcgtcca acttgtgacc ctcgccgacc gccccgcggg cccaggcggg ccgggggcgg 5580 
ggccgtcatc cagaccaaag ccatgccatt gcgctgcccc ggccgccagt ccgcccagaa 564 0 
gccatagacg agacgtaggt agccgtagtt ggacggacgg gcagggccgg cggggcagcc 5700 
ccctccgcgc ccccggccgt cccccctcat cgccccgcgc ccacccccat cgcccctgcc 57 60 
cccggcggcg gcctcgcgtg cgagggggct cccttcacct cggtgcctca gttcccccag 5820 
ctgtaagaca gggacggggc ggcccagtgg ctgagaggag ccggctgtgg agccccgccc 5880 
gccccccacc ctctaggtgg cccccgtccg aggaggatcg ttttctaagt gcaatacttg 5940 
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gcccgccggc ttcccgctgc ccccatcgcg 
gcgcgcgtcc cccggtgacc tcggggagca 
agaggcaggc ctggctgcgc agggcgcggg 
atgtactgac gagccgaggc agcagtgccc 
ccacaccccc attccgcgca ataaacgaca 
tgcccgagac ccgcagggcg tgcacccttc 
agagctttat ttacacactg acaaggctca 
gcttcacctg ctccaggtcg aaggcccctg 
cgggaaagcc ccgccctggc ccgcagttcg 
acctggcttg ggcaccgcct gcagtgtctc 
gggcctgggg ttggaagcag ggtggggtga 
aatccaggta gttggggtca gggagcgcct 
ctggacaggt cctgcaggat gggctcgctg 
tgggcaggag tcagaggatc cccaggggtg 
caacgcccca gtgtgggggc cccacccatg 
ctggttcatg acggagacat cagctgcgtg 
ggtcaggccc ttcctgtggc agagcggagg 
acccagtgac cagcatcctg gccctgcgct 
ccaccctcct gcaggcctcg ccccacccct 
aggtatcagt acagggggag gaaatgttcc 
ccccacgctg ctgtgggagc ctcagctccg 
gccaccaccc cgcatcctga gcattcccag 
acctccacgt gctccagatc taaccacaca 
gggcagcgcc catgctcggc tctccctgtc 
ctacattcgg ggcacacagc ctcagggcct 
actctgccca gtccctgccc ccagctagac 
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ctcacgcaat aaccggcccg gcccccgtcc 6000 
gcaccccgcc tccctccagc actggcaccg 6060 
ggggaggctg gggtcccgcc gccgtgatga 6120 
ccaccgtggc cccccacgcc ccattaaccc 6180 
gcattggcgc caagcctggc cgcgtgtgat 6240 
ctgaagacag tggctcctgg gggtggcaaa 6300 
cggggtgtca gctgaagaag taggtggaac 6360 
cggaggaagc agagcggacg gcgtgggtgg 6420 
agccaccctt gcgaggctgc ccacccgcct 64 80 
cttcagctgg ctggcctcca agatcttctg 6540 
ggctgaggcc aggttttggg gtgggggggg 6600 
tactcagagc agaaccgctt gaccaggaat 6660 
tgcaagcgga caaactgctc ccggcacacc 6720 
atcaggcagg ctctgggcac cacccctacc 6780 
ggtggactga ggctcagact acgggggcac 6840 
agtccagtaa cagtcgtgca cagagacgaa 6900 
actcctgaag ggaggggagc tcacagggcc 6960 
cagccccctc cacttgaggt ccagggaagc 7020 
cgccccgccc ctccccaaac atcctgggtt 7080 
cagaagcctc ctcgccccac ccctgccgcc 7140 
agggcggcta cgaggtcccc tcctgccagg 7200 
ctcccgtggc cggtagattc tgctggaacg 7260 
tcgcggtgcc aagaaatgcc cagcaggaag 7320 
gggccacagg aggggagctg ccaggaccac 7380 
ctacacaggc cccacagaca cagcagatcc 7440 
ccagccttgc cagctgtgcc ctgctagcca 7500 



WO 99/37660 PCT/US99/01313 

-158- 

gaagacgccc ctgggaggcg agcggcaccc acgccgtccg gagacgccca cctgtagcag 7560 
tgcagggcgg tgagcatcat gtgggaggag tccagcgagt ggatgaagtt gggcgggaag 7620 
ccgttcttct gcttacgtgt gttgggcttt ctgaggacgg aacaggtgcc ggtgggggcg 7680 
gcccagggac acccctaact ggccgctgtc tccaccgtgg ctgctctcca gacccccggc 7740 
caggccccag cccgggcccc ccactcaccg gctgatgtct ccgttgtggg tgtaggtgat 7800 
gctctgaatt ccacctccta tttgctaaaa aggggaaggg gccggtgagt cccacccgag 7860 
gcccagcacg gtggtggtac attgaggtgg tggtacactg gggtagtggc acgctaggat 7920 
ggtggcacac tggcgcggga tggggtggca cactggggcg gtggtacact ggggtggtgg 7980 
tacgctgggg cactggtaca ctgggacgct gttacactgg gatggtggca cactggggag 8040 
ggatggggtg gtacactgac cttgaccttg gagtccaggc gatagggctg gatgacgggg 8100 
acgcccaggg gtgtgaccca ctccaccaca gagcccatgt gggagatgag gcgggcactc 8160 
tcggtcagcc agtgctgtgg gacacaggcc gtctcagggc agggggctca ggccggggat 8220 
cccgtccact tgcttaggga gtcctggccg agcggggaca ggacaggacg tacctggatg 8280 
gcccgggtcc ccgagaacat ctcctgtaga ctcttgaaga cctggcgtac gagatagtga 8340 
gaggcctccc acacgaactc ctgcagaggg cgggcagcag gtgcaggtcc tcaggggctg 8400 
gcccgttcac gccctactcc cccctatttc agagccactg aggcccaagg cctagggcct 84 60 
agcagggggg caggggaatg gggcctggcg cccacgcagt cagcaagaaa cgcccaagcc 8520 
ctaacaggca gccagtggtc tgggggagca gccagggctc ctgctgggag gctgggtcgg 8580 
gggcacaccc gtctgagttt taaatggcag tgaaaccaac gtgttcgcag cgcgacatgc 8640 
ctggcgcacc tggggaaagt cgctcagctc ccggaggcgc ttctcaatct gcaggcgccc 8700 
gccatagcgc gtgaccccgt acaccaccgt catcaccgtc tgctt caeca ecttgegggt 8760 
gatgaaacct tccagcacct gtgccacccg catgccccgc tgggegtect gectaeggaa 8820 
cacctccacc tgeaeggegg gtgggccggg ggcgcgggtc agccccgcta gcagcccagg 8880 
ggccaccaag cacccatgaa gcccccgccc cagccccacc acatcctcag gaeaggecaa 8940 
ggtgagggca cctggggccg agactcaggg ctcacattgc ccccacgccg agatgccccc 9000 
gggcagcagg gcacacccta cctgcgcggc cacgccgctg tacacgtcct gcggcacatc 9060 
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cgagggctcc aggttgacgg aggcggcgcc 
ctgcaggccg ttgcaagagc cgtcctgagg 
agaggcagag acgtgtggga ccccaaacca 
gcacccccca gccaagtgca ccggagcccc 
ggctcccgca cactctagga ccacctccag 
accctcaagt cgagccccag gcccagtgca 
tggctcacct gatggacggg gaggtgggag 
gcgttcgcca cctccataca gcaggccagc 
cactttcggc cctgcgggga cagcggatgg 
gcccgctggg aggctgtgtt gcggggaggt 
tagtgaacac gggacgcatg tgggcgagag 
gccacggatg gaggatggga gctgcgggtg 
gccgggggag gagggtggac agaggacctg 
atgcagggga ggggggaacg tggggaacgc 
gcaggacggg cagggggcgc gggggaggag 
£999993999 gaggaggaag acgggcaggg 
gacgggcagg gggcgcgggg gcgccggggg 
gggtgccggg agggcgggga atgcgggggc 
gagtccagga tgtcatccat cacctcctcc 
cgcttcttca accccgtgag attgaccagg 
ccgagcgggc ggccctgggc gaactccagc 
ttgaagtgcg gcgggcaggg gtaggtgcgg 
aagacgcggt cccgcaggtg ctgcgccagc 
ctgtgcatct cccgggccac cttctggcag 
cgggcgggcg cggcgctgtg cggcaggtgg 
ggggccggca cgcctagctg ggggcagccc 



cacgctgtcg cggcccagag cagcataatg 9120 
aaggggcggc aaacgggaga tggaagctag 9180 
ccccccaggt cgagccgttc ctagggccgt 9240 
cgcacgctcc cgggagagac caggagccat 9300 
agaataccac gagcgaaggt gaaatctcac 9360 
cactgcacgg cctcgggggc cagacccagc 9420 
acataggcgg cagggtcgga ggcgcgcaca 9480 
gtctgccagg gttcctccgc gcccatccac 9540 
ggggcagtga ggcccgggcc cgatccctga 9600 
gggaaatggg gaggagacgc acacccgtga 9660 
acggggcggt ggctggatga gttctccata 9720 
gaccgggctg aaacaagcgt gtccggagct 9780 
ggggcgccgg gggaggaagc agctcggcgg 9840 
gggggccctg gggcagggga gaagggagaa 9900 
agcgggcggg ggacgtgggg gcgccagggg 9960 
ggcgccaggg gagggggagg ggaggaggaa 10020 
agggcgcggg ggcgccgggg gagggcgcgg 10080 
cccgccccta ccgtcaaggg ttggtccgcg 10140 
gcaaaggcca ggcgcttccg cagcggctcc 10200 
tggatcttga gccaatccag gccgtgcggg 10260 
agggcccgcg ccacgtcgct gcccaggtgg 10320 
ccgcggaagt ccatgttgtg cggcagccag 10380 
gagaggcggt acagcgcctc cgcccgcagg 10440 
tgcgccagct cacggcgcag ctcggccttg 10500 
gcctcgggcg gctggggcgc ctcggagggc 10560 
ttggcctgga agagctgcag caccaggtcc 10620 
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agcacgcgcc cgttgacgcg ccaggcgcag 
ccatgcagcg cggtgggcgg gcaggtttcc 
accgtgcgca tcagcttggt ggggctgagc 
ggcagcgggg ggcaaagcat gggtacatcc 
gccttctcca gcagctgcac gtaggccggg 
acaggcagac gggtcagggc cccggtgctg 
ggtgagaggg gccggctccc cggccaacaa 
ccacctccag aaacggccgg acacctgcat 
aggtactgac ggctgcgctg agatcaaggc 
cctggtcctc ccagcgggat gccccccagc 
cgttgcggaa ggaatacacg tggtagagca 
tgtccaggct gcatggcatc tgcgtagcct 
gctccatctg cactggcagg ggccagggct 
cctcccagta ctgccgcggc aggcagggct 
gctgggcacc ggggcccctg agctagatgc 
catctgtcag cccaagcata cagatgaaca 
ccagggaggg agagcgccca cccaccactg 
ctcacctcgg cgtcggaggc cagcaagcag 
tgcacctggc cactgacccg ctgcctctgc 
agctcccggg ccagggtggt gaaggactca 
ggaaggcagc cgtgagtgcc tgcccgcccc 
gcacctgcag gagcatccgc accacctcgc 
gtgagaaccg gccctcgtac acctcgcgct 
ggcacagtgc tttctcccat tggtcccgca 
gaacgggccc ggtgagcccc gtggcagctg 
ccgcagctcc ctgcagagac ctcatggccc 



ttgcccagtt gggtgagggc gtccagtgcg 10680 
agcagctcct ggtgctgcgt ggcgccttcc 10740 
aggaaagcac cagagtgcgg cgatgtccag 10800 
accgcctcga aggtcagcgt gggctccgcg 10860 
tgcggcttca ggatgccgat ctggggtgcg 10920 
gggctttcct gttcccaccc cttaaacttg 10980 
gaaaccagtg tggcctccca cgaacagaag 11040 
ggacacccat ggtgtgtccc gagtcctggg 11100 
tccgcccaaa ggcgccaacc ccatggggtc 11160 
tcaggagggc actgcctggc acctgctgga 11220 
cggggacaag ccgagaggaa cgatgcggct 11280 
gcaccagcat ctccgccagc agcttgccca 11340 
gctcccgcag ggcctcgggc gcccccagct 11400 
cgggcacctg taggacaggg cggtcagggc 114 60 
cccaccgccc gtgcctgacg cccggtgggg 11520 
gactgaagct tgggtgcaaa cccggctgct 11580 
gccccagcca ggaggagagg gggtgcgagc 11640 
aggtacttcc tgtagtggtt ctgcagcgcc 11700 
accacgtgcc ggctgaaagt gcgcgcactc 11760 
ccttgggcgg gcagcgcctg caggacctgc 11820 
gcccggggac ccggccgcgc ggaggaagac 11880 
gctcgtccag caggcacagg aaggggtaaa 11940 
ctaggcggtt cttggtctcc cgcagcgccc 12000 
gggtcttcag ggtcttccgc tgcgggggat 12060 
gtgggaccca ggctcacagg acgggggtca 12120 
tcaaggtccc tgctgtgtgt tccgggtagc 12180 
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tcctcacccc ggcctgccct ctgccggctt 
cccagctgca gtgtgcgcag aagcacaggc 
tccctgagcg gcagccaggc tgagttccta 
aattcaaccg catacacgga gctcagcccc 
ctgcggcgga cacgggaccc cggctcagcc 
ccctgcggcg gacacgggac cccggctcag 
gcaacgtggg cttctccacg gacaccacgc 
gcttctcaaa gaggcactgc agggtcttca 
gcccatcctg cagggatggg ggtagtgagg 
cccaggaccc cgagacagca tgggtgcacg 
gctatcgctg acgcggggaa aggcgggctg 
aaaggccttg accctcctgg cctcgacccc 
ggtggcaggg gcggtgatgc tgggagctgg 
ccagatgttg gctgttttct cctcaaactc 
tagatgaatt aaaaataaca actaaagcct 
cataaaagct tgtacacaaa cgttcacagc 
gcccaacgtc ccataaacgg acgaacagac 
catgactcag ccctgaccca ggtcgcctcc 
gggagatgcc aaacacaaaa ggtctcgcag 
agcagactca tccacagatg gggaggggat 
cttttagggt gatggaacat tctagaatta 
ttacaccttt aagagggttt tatggcaggt 
tgcgtgcacc tggcagcccc actcaaacgc 
gcggcccccg accacctcgt ccccctgagc 
tctgggtcct cccatcttct cagaggagga 
gccctgggca ccggggaggc ccattggtct 
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cagcgtgcct gacgcagcca agagcaaaag 12240 
caagacccaa cctcgggacc ccacaagttt 12300 
ggccctgcat gaccagacca gggcatgagc 12360 
tgcggcggac acgcgacccc ggctcagccc 12420 
cgtgcggtgg acacgcgacc ccggctcagc 12480 
cccctaccgc gtgcttgacc tccttgcttg 12540 
acaccctgct ggccagctcc atgtggagct 12600 
agggcaggtg cagcttcggg taggacacac 12660 
ttgggggctt gccagagggc gacctgccct 12720 
cgtttctgcg tctcctgcaa gttgctggtg 12780 
cgggtaaagt cagtgccagc agtgcaaacc 12840 
tctagaaggg acactgggca ccgtgcaggg 12900 
cagagcctgg ggagaccgtt cactgcaccc 12960 
agaactgtat gaatgtgacc catccagaaa 13020 
agcgctttga gaatcaaaga cgcacgtcca 13080 
tgcatgactc gcagtcgata agtagaaaca 13140 
gggcacggcg cggccatcca cgcaccggag 13200 
cggaggcacc atgaggacgt cacgctcagt 13260 
tgtgtggtcc catttctatg gaatgtccag 13320 
ggggagtgac ggggatgggg acgaggcttc 13380 
gacaaccgtg actacactaa aatcgctgaa 13440 
gaattacacc tcagtaacag acgagcccac 13500 
actgctctcc tgtcacccca ccctctctct 13560 
ccacaccctc agggccaaga ccctcccagc 13620 
agggaggaat tcagggccca gcccaggtga 13680 
gagctgaggc tccaggaacc cccaaagggc 13740 
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agctataagg actgaagtct gccggggccc 
gagcagcttg gaggtgttga ccgggggcgg 
caccttgtgc acggccttca gaacagtggc 
gaagagtgcc tgcagcttca gcccctcctg 
gcggcctgct cgagggacgg gccagcccca 
tcgtgaccac ctcctgctag cctgcaggtc 
cccagccccc agcccaggca ccgtcccaga 
ggtgggggcc ctcccgacac cacctacctt 
cccatgcact ggagggcagc cgcataggac 
accatgaata acacatatac cagctccttg 
gggtctgggg ggatggccca acctccacat 
tagccaccag ctcagcaggg gacagggtca 
atgcatcccc ctgaggttct gacacacaag 
ccaggtggga ctgtgggagg ttcacgttcc 
ttgtcaattg tgtggatctt ttcaaaggac 
tgccttccat ttcatcgatt tgttttaatc 
ggtttagtga cagcttcttc ttctagtttc 
gatgtttcac tttttttccc cccaagatgg 
agtggcacaa tctcagctgg gccgggttct 
ctcagcctcc tgagtagctg ggattacagg 
attcttagca gatacggggt ttcaccatgc 
gtgatccgcc agcctcagcc tcccaaagtg 
ggccatcacc tttccgaata taggcatttt 
tcagctgcat cccaggactt ctgacaggtg 
ttcgaacttc atagtttact tcttctttgg 
gtctcgctct gtcgcccagg ctggagtgca 
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acgtgggctc accttggcat acacgtccct 13800 
cagctgcggc gggaggctga aggtgggctt 13860 
ccgatcctcc tcagacagca gaacggcggt 13920 
gctcatctgt tccagacacc tgtggtgcag 13980 
cgctgggctt ccacagaccc caggggaacc 14040 
tcggtgtggc tgtcaggccc tctgggggtc 14100 
tcttaaaacc ctgggaggga catggtgggg 14160 
tcgatggtcc cggcgtcctg gtcctgcctc 14220 
agcaggtccg gagtcaagcc ggcatccttc 14280 
aaggcaccct gggagaccaa gccagggtga 14 340 
cctccctgct ccctggagac cccttctctg 14400 
ccaggcagga gtggccagct gggcagaccg 14460 
ctccacctgc agaggcagcc gcatggcccg 14520 
tctgggaggc agcttgttaa acctccagat 14580 
tgacttggct tgactgttct ctgctgtttc 1464 0 
tttgtaactt cctctcatct acttgcttta 14700 
ctaaggtgaa aggtgacgta tttggtctga 14760 
agtcttgctc tgttgcccag gctggagtgc 14820 
ctgcctccca ggttccagca cttctcctgc 14880 
cacacgccac cacaccagct aattttttgt 14 940 
tggccaggct ggtctcgaac tcctgacatc 15000 
ctgggatgac aggtgtgcac caccgcgccc 15060 
gtgactataa attaccctgc gagcactgtg 15120 
gtgttttcat tttcattatc tccaagtgtt 15180 
aaattttatt taattatttt tttagataga 15240 
gtggcgcaat ctcagctcac tgtcaacctc 15300 
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cgcctcccgg gttcaaccga ttctcctgcc 
acatgccacc acacccagct aattattttg 
ttggccaggc tggtctccaa ctcctgacct 
gtgctgggat tacaggtgtg agccaccacg 
caattttttt ttttttcatt taatttttgg 
agcactttgg gaggccaaga caggaggatc 
caacatagtg aaaccccatc tccactaaaa 
gcgtggtggc tcatgcctgt gctccctcca 
tgggcgtggt ggcacatgcc tgtaatctca 
ttgaacgcag aaagcggaaa ttgcggtaag 
gagacagaaa ctttgctgtc gacagacttg 
acatatatat atatatataa aataacatat 
cctgacatcc ctgttctgag caatttctcc 
gtaactgtaa cagactattc caaagggaaa 
tcctgcagct gaagacatgc tgtgacatga 
agttttgggg catttttttt tttttttttt 
gctggagtgc agtggctcga tctcggctca 
tcctcctgcc tcagcctccc aagtagctgg 
aattttttgt atatttttag tagagacggg 
ctcctgacct cgtgatccgc ccgcctcagc 
ccactgtgcc cggccggttt tggggcagtt 
tggccacaca tagtccttaa attgaaatat 
ttcagtagca cacacagctg ttggcagatg 
actggatggt gctggttaga atacgtggcc 
cacttaggga ggctgaggcg ggcggattac 
caacatggca aaagcccgtc tctactaaaa 



tcagcctcct gagtagctgg gactacaggc 15360 
tatttttagt agagatgggg tttcgccctg 15420 
caggggatcc acccgcctcg gcctcccaaa 15480 
cccagccatg tatagcttaa atatcccctg 15540 
ccaggcacag tggctcatgc ctgtaacccc 15600 
acaaggtcag gagtttaaga ccagcctggc 15660 
atacaaaaaa aaaaaaaaaa aattagctgg 15720 
ctaaaaatac aaaaaaaaaa aaaaattagc 15780 
gctactggga gcctggggca ggagaatcac 15840 
ccgggatctc accactgcac tccagcctgg 15900 
gagactctgt cttaaaatat acacacacac 15960 
atatataatt tttttcttgt attcattttt 16020 
tttgacccag tggctgctta agagtggcct 16080 
aaaattccct tacatcctcc caccccatag 16140 
ggtggccaca caccagagac cagagacatg 16200 
tttgagacgg agtctcgctc tgtcgcccag 16260 
ctgcaagctc tgcctcccag gttcactcca 16320 
gactgcaggc gcccgccacc acacccggct 16380 
gtttcactgt gttagccagg atggtctcat 164 40 
ctcccaaagt gctgggatta caggcgtgag 16500 
tctaaacaac ctctgtatgg tagacctcac 16560 
tcagttcttc cctttcacca gcttcaagtg 16620 
cggaaaattc ccaacatcat agaaagttct 16680 
gggcgcggtc gctcacgcct gtaatcccag 1674 0 
ctgaggtcag gagtttgaga ccagcccggc 16800 
atacaaaaat tggccgggcg tggtggtgag 16860 
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tccctgtaat cccagccact caggaggctg cggcagggag aattattgaa cccaggaggc 16920 
ggaggctgta gtgagccgag atcatggcac tgcaccctag cctgggcaac agacagagag 16980 
tctatctcaa aaaaaaaaaa aaaaaaaaga tagaagcaat gccttagcct ggctaacatg 17040 
ctgaaacccc acctctacta aaaataaaaa ttaaaacaat tatccggggg tggtggcaca 17100 
cgcctgtaat cccagctgct cgggaggctg agctcgcagt ccagcgacat ccaggactgc 17160 
tggccacccc ggaacgctgg gagaggcagg aggggcccct gctagagcct ctggagagac 17220 
ttcgggtctg cagacatctt gattccagac ttctgggctc gtgctaagag tgcgtttctg 17280 
ctgtgcaagc cgccaggttt gggacacttt cgtaggggcc gatcccaaaa gcgccctgtt 17340 
acagtgtggg ctctctgccc agggaatcca gggggcttgt gaccttggag gggaaaatac 17400 
acgaccctca tcctcagtcc tcccggagtc tggcgccccc tgcagcaagg aggaaccagg 17460 
cagcacgccg cctccacctc gcggtaagag cactgcggac ttcaccgcaa gactggcccc 17520 
acctgatcct gaatttcgct gtttgatgcg ttaataaaga agcacatcaa gttctctacc 17580 
acgaattggt cttaatattg cgatatctgt attttaatat aatagtatcc catgtttacc 17640 
caaatattaa gagaagcttt tactgttgtt tctcaaatta gggctgaagg atcatggggg 17700 
gggagaaagc tgggaacgtt tgctgctttg aaagggtgtg taaacaacac cctccaaaac 17760 
aaccaagagt tccgaggaga aactttggcc ggatacggtg gctcacgcct gtaatctcag 17820 
ctcctcggga ggctcagggg ggcagatcac gaggtcagga gtttgagacc agcttggcca 17880 
acacggtgaa acccccgtct ctactcaaaa tacaaaaatt aatcgggggt ggtggcgggc 17940 
acctgtaact ccagctactt aggaggctga ggcaggataa tcacttgaac ctgggaggtg 18000 
gaggtggcca tgagccgaga tcgcaccacc gcactccaac ctagtaacag ggagagtatg 18060 
tcccagaaaa caaataaata aacaaacaaa aagaaaacgg caagggaaat tggaaaatac 18120 
tccagatgaa ccacaacgaa gatgggtggg atacatctaa agctgtgctc agagggaatg 18180 
cggcgccagt gaacacccac atttcacaca gaaggatctc agcacagcag cccgaccttc 18240 
cacctcagga aaccagaaaa aggagcaaag tcaaccccaa caccaaagcc tcatcctgac 18300 
gagggctctg caggctgccc cccgacgagg ccaaaagcac ccctgcccag acagattcac 18360 
gagccccgag aaagaacgga aggaaatgct caaggcatta gcagaatttc tccctacttt 18420 
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tttggtcatt ttcaaaattt gagagtcaca 
ttattaaaat aaaaaacaaa ggacttgaac 
aatgagggca agttcctgtc tccaacgacc 
cacctgccgc gcccagccaa gcatcacggc 
cttctgccgc tggccgtggt ggacgaccag 
gagcaggcag cacttgaaga aggccaggag 
cgagtgctgc cccgacgggg cctgctccac 
cagcagccgg gccagctgct cctcccaggg 
caggcacccg gccatctgct tgctcaggag 
cgccttgaac tccccgctct ggaatggcat 
catctgctgg gtccgcttat ccttctccag 
ggtggcatcc ttggccccca tctggacctt 
gccacattct gggagccgcg ccacatccac 
agcctgcagc tgccgcaccc gcgcctggag 
gagggacccc ctccccattc gagcacccgt 
gcaggtggct gagctcgctg ggacccaagg 
gtaaggtctg cccgctgctt tccagacaca 
cgagcctcac agaactcagc agccctaaca 
tccccggtca ctggccgctc agaccctcca 
acaactccct ccgtccacct ctcagtacct 
gtggcccccg cctcccctgc ccctctcact 
aggaccctga accgcccact ccctggctca 
agcaagaccc ccccagctgg tggagctgcc 
caccggtgga cgctgagcca ctagctcgca 
caggttgcat ttttgccaac caatcaatgc 
ttgttggaaa tctattgctg gccgggcatg 
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cgtgatttgt atttgaaaag cctaaaagaa 18480 
ctgggggcta agagagaaaa gtccagtcta 18540 
agggcaggtg gcccggctcc cggctgcact 18 600 
gttgtacatg tccagcgtga gcagcttccg 18 660 
caggtggtgg gcgaggggca gctggtcagt 18720 
cctctgctgc tgacctgaga gctgggcctg 18780 
atcgaggctc agcttcccag gggcctcctg 18840 
gctctcgggg gcctggcgcg tgcagtcctc 18900 
ccggggctcc acctgcaggc gcctggtcag 18 960 
ctgcagcttc gccttcaacc gctgcatacg 19020 
tatctttgcc cagcggccac agggcaccgg 19080 
cctgggtggc tggaggctac catctccact 1914 0 
cctgttcacc accacctccg acacgctctc 19200 
cactgtgagg ggcagaaggc gaggacatga 19260 
ctctctggac cctgagccag gccaggaggt 19320 
cgtgaattcc tcatacttgc caacaacgtt 19380 
cgcaccccac cacctccgca cctccccacc 19440 
agctgccacc gaaacctgca gcaccacgtc 19500 
ggtgcacagg cccagaaccc ggggtctgtg 19560 
cctctgggct tgcctccaga atctatccag 19620 
gtctagctca gggcctctgc acagactccc 19680 
accatggcct gcaagttcgc accccgcctc 19740 
acacacacac tcctaggctc ccagtgtcta 19800 
gggaaaacgc ggctcctgct cgtgccgcct 19860 
ctaagtgttc tgtatctctt taaagaagcc 19920 
gcggctcacg tcggtcatcc cagcactttg 19980 
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ggaggccgag gcaggaagat cacctaaggt 
gtgaaacccc gtctctatta gaaatccaaa 
tagtaccagc tacttgggag gctgaggcag 
tgcagtgact caagatagcg ccattgaact 
taaaaaaaaa agactgttga aataagccgg 
actccggtgg ctgaggtgaa agaatcacct 
gtgatcaggc caccgtgctg cagcctgaga 
aagggggggg gggacccagg tgtccagatg 
taggaggccg aggcaggcgg atcacgaggt 
gtgaaacccc gtccctacta aaaatacgaa 
tagttccagc tactcgggag gttgaggcag 
tgcagtgagc caagatcgca ccattgcact 
caagaaaaaa aaaattgctg aaataaaaag 
caaaactcag gagatacttt taggattaac 
ttttgagatg gaatttccct cgttgcccag 
ccgcaacctc caccttccgg gttcaagcga 
cgattttcct gcctcagcct cctgagtagc 
gctaattttt gtatttttag tagagacagt 
actcatgacc tcttgatccg cccgcctcgg 
gccaccgcac caggcctcgg acccttgacc 
gtgctgggag tacaggcgtg agccaccgca 
gcccacctcg gccacccaaa agtgctggga 
gttttttccc tttataaagg ttctcccgcc 
acaggatgtg ggacagaagc accggcggga 
ccacaccacc agccaggcca ggacgagggc 
ccaagtttcg cagtcatcca gtctccatta 
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caggagttcg agaccagcct ggccaacatg 20040 
aaattagctg ggcgtggtgg catgtgtcta 20100 
gagaattgct tgagcctggg aggcagaggt 20160 
ccagcctggg caacagaaca ataatccatc 20220 
gtacagggcc gcgcacctgt ggtcccagct 20280 
aagcctagga gttcctggct gctgtgagcc 20340 
gacagagcag gaccctgtct caaaaaaaaa 20400 
tggtggctca cgcctgtaat cccagcactt 204 60* 
caggagatca agaccatcct ggctaacacg 20520 
aaattaaccg ggcgtggtgg tgcgcgcctg 20580 
gagaattgct tgaactcggg aggcggaggc 20640 
ccagcctagc aacagattga gaatccgtct 20700 
acaagcgtga tgtccgcctt cagagtgctc 20760 
agttgagagc tttgttttgt tttgttttgt 20820 
gctagagtgc aatggcatga tctcggctca 20880 
ttctcctgtc tcagtctccc cgggttcaag 20940 
tggcactgca ggcgttcacc accatgccca 21000 
gtttcaccat gttggccagg ctggtcttga 21060 
cctcccaaag tgctgggatt acaggcgtga 21120 
tcttgatccg cccaccttgg ccacccaaaa 21180 
ccaggcctcg aacccccgac ctcttgatcc 21240 
ttacaggcgt gagccaccgc acctggccag 21300 
tctcccttcc cggctgccta atggacgcag 21360 
agcaagcaca gggaagctcc cacctccctc 21420 
ctgccaccgc tggagcctgg gctgtccctc 21480 
ggcgcctacc ccccagagcc aagccaggac 21540 
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agctgagtca gttcagggtt cacatcctgg 
gggggggggg tctctccaga cataatcttg 
cacctggtgg ggttctgggg agggggaatt 
aaggacaggg cccatgagcg gtgcaagaga 
gggacccgag aggtggagac cgccctcagc 
tcccggcagc agggagcgga ctctcctctc 
ccagcagctc cacgtggccc cagtccttcc 
cggacgagct cctcctgggg ccgcagacgc 
cggctgagcg gggccgggcg tgtgggcggg 
gaagggcaaa ggagccctaa acgcagcgga 
gaaactaaga cagcgaaggt ggaagggccc 
gagagagggt tcctcgcact cgaggtgcag 
cacctggggt caggaggctg cataagaaac 
gcctcagtcg aaaagcgggc aagggcgagt 
acgcccaggt gcaaagaggc aaaggtcaaa 
gcccacgccg tggcccccgg gctgcctggc 
ggagtcccgg gcggccgcaa ggccgtaggg 
agcaaagtgc cgacattacg cacgccgctc 
tgcgcccgcg ccgcctgccg ggagttgtgg 
accccagccg gaagtagact cccaggatgc 
cgctctgatt ggttcctcct gctgttttta 
tggcaactgg gaggcactct caggctgttt 
ctgtttttct ttcttttttg gttttgtttt 
tggcgtgatc atagctcagt gcagcttcga 
ggcttcccaa agcgctggga ttgcagacgc 
tttcaaggca tactcatcta ataacgagga 



ctctgcacat gtggccttgg cggcggggcc 21600 
ggcctcacct atgtccctgg aaagtgggag 21660 
acgagagctc caggaaggag cctgctcagc 21720 
tgtttcagca acgccgtctg ggcgtgtcct 21780 
ctgtctcaga atctgagcct ttgccttttc 21840 
ccgggccgcc gtgggggtcg cgctcaccct 21900 
tgcggtcttg gtcttgctcc tgggggctgg 21960 
caccggcggt ccctgcggga aagacgagag 22020 
ggcctccata aaggcagaag ccgaagggtc 22080 
aactctcgga gcacgggctt aagttggaaa 22140 
cgccgcggcg aacacgggcg cggaaccgcc 22200 
caggtcaaag gttaagagcc ctaaacacca 22260 
cacgagtcaa aggtcagact gcacggagga 22320 
ggaaagcggg gccgggtcgg tgggctgcgc 22380 
gcgccaaagg ccccggccgc gcggggagga 22440 
cgtctccctt tgtgttacct tctttgccgg 22500 
ctcgtttgag ccccgccgct ccgcggcccc 22560 
caggccaccc caccggcccg cgcctgcgca 22620 
tttcatggtc gacggaggct gcgaagggaa 22680 
agcggaggcg cgaaggcatg cgccggtgga 22740 
aagggagggg gcgggacaga gctgttgccg 22800 
tcccgaggac ctcaaatccg gacttttttt 22860 
ggacgcgttg tggcccaggc tggagtgcag 22920 
actgctgggg taaagagatc ctcgcccctc 22980 
cgccaccgtg cccggctttt tttttttttt 23040 
cagcatctgc aatttagaga ttcctgtccg 23100 
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caaccttcat tgctccaacg acaacttttg ggtaagagtc attaggatgc cgtctatcat 23160 
ggaggaagct gaggctcaga gagggccacc aagttgctgg aagacacagc acgtgcgacc 23220 
tcagggaggc tgcaaggaga gaaagcccca gtccgcgaga ctcccagcct ccagcttcag 23280 
tttaccctcc aatccccaag ccctcagggg caggagccga atggagcggc aggcttggat 23340 
tcacctgcta agtggggtga ggtcaaggga atgaaataaa cctcggagcc tagagcctgc 23400 
cctggtctcc gcgtgatcct gcctaggagg agcagggcgg gagctttaga atggaacctg 234 60 
gaaggtgtgc ccacctgtgt cgttcagccg gggcagcagg ccagaggcgg gagcgcctgc 23520 
tgtggggcag taggcttggg aagggtgaga ataggaatat ctgggggtaa ctgtgttcca 23580 
ggctaatatc ccagttgcaa aggggagctg gtttggtggc tcaggcctgt catcccagca 23640 
ctttgggagg ctgaggcggg cggatcacct aaggtcagag ttcgagacca gcttggcaaa 23700 
tacgcaagca tgcctggcaa catggcaaaa ccccgtctct agtaaaaata caaaaattat 23760 
ccgggggtgg tggcgggcac ctgtaatccc agctactcgg gaggctgagg caggagaatc 23820 
gcttgaaccc gggaggcgga ggttgcagtg agccaagatc tcgccactgc actccagcct 23880 
gggtgacaga gcgagaacct gtctcaaaaa aaaaaaagtg caaagggagg tcagttcagt 23940 
gcctcaggcc tgtaatccca gcactttggg aggctgcggc gggaggatcg cttgagccca 24000 
ggagttccag acaagccttg ggcaaccgag atactgagac ccagtctcca ccaaaggaaa 24060 
aaaagaaatt agccaggcat ggtggtgcac acctgtggtc ccagatactc gggaggctga 24120 
ggcaggagga ctgcttgagc ccaggaggtt tagactgcag tgagctgaga tggcgccact 24180 
gtactccagc ctgggttgac agaacaggac cctgtctcaa aacaaaacaa gtgcaaaggc 24240 
cctgaggcag gaacaagcgt ggacagagga gcaatttgag cagagtgggg ctggggagag 24300 
ggagcaaaga tgtagctggg gctcagttag ggggcctgac cacacggggg ctcgggggcc 24360 
tcagctcaag ctatcctcca tccccaaacc ctggcacttc agtttcccca tcagcccaga 24420 
acgaggactc gacctcactc tggaagggcc tggcagcctc cttacagcac attccagacg 24480 
ctgctgccga cgcctgcgtg agcgcactga tgccaccggc tgggaatgtt ttcgacagac 24540 
ggcagcaccc tccctcacct gcctcagtcc acctcagggt gccccagcgg gctgtgacct 24 600 
cagacctcac ccactactgg ggtcacctgc ctggccctga atcagccagg cctggtgtgc 24 660 
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caagacctac agacaccccc tgcacccctg 
aaccgacttc tgaactattt caccattcct 
gatcccagca ctttgggagg ccgaggcggg 
agcctggcca acatggtgaa accccgtctc 
gtggcctgtg cctgtaatcc cagctactca 
tgagaggtgg aggttacagt gagccaagat 
tagccaaacc ccatcactac aaataataca 
cctgtggccc catctacttg ggaggctgag 
aggcagaggc tgcagtgagc tgtgattgag 
agaccctgtc tcaaaaaaaa aagaacataa 
gaaacagctc attgcccaaa ttccagcctc 
cccaccattt attttaacta catctgtctc 
tggctggtca tggtggctca tgcctgtaat 
atcccctgag gtcaggagtt cgagactggc 
taaaaattca aaaattagcc ggacgtggta 
ggaggctgag gtaagagaaa tgcttgaatc 
ctcgccgctg cactccagcc tgaacgacag 
aaatacaaca ttaattattt ttcttgctta 
aaaagtggga aaccatatat ctggccgggc 
acgggaggcc gaggcgggcg gatcccctga 
atggtgaaac cctcatctcc aataaaaata 
gtaatcccag ctattcagga ggctgaggca 
ttgcagggag ccgagatcac accactgccc 
ctaaaaacaa aacaaaacaa aacccaacca 
atggaagggg atggcccagg tccctggagc 
cttaaagcaa cgttcatttt cttacatttc 
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caggctggca gagccagaaa cttgggtgga 24720 
tatgcgttag tcttttcttt tatttgatga 24780 
cggatcacgt gaggtcagga gtttgagacc 24840 
tactaaaaat acgaaaatta gccgggcatg 24 900 
ggaggccaag ggaggaaaat cacttgaacc 24 960 
cgcaccactg cactccagcc ttgggcaatg 25020 
aaaaaatttt gttggctgtg atggtgcctg 25080 
gtgggaagat gtagaattgc ttgagccagg 25140 
ccactgcact ccagcctggg cgacagagcg 25200 
tctgggtttt ggaataagac agcagtttct 25260 
gcaactctgt agccgccacc accccccagc 25320 
caccactcct gtattaagta aatgcaatat 25380 
tccagcactt tgggaggctg aggcaggcag 25440 
ctggccaacg tggtgaaacc ctgtctccac 25500 
gtgggtggtg cctgtaatcc cagctacttg 25560 
caagagactg aggttgcagt gagctgagat 25620 
agcgagactc cgtctcaaaa ataaattaat 25680 
agttttacga agagacttaa tatcaccatc 25740 
gtggtggctc ccgcctgtca tcccagcact 25800 
ggccgggagc tggagaccag cctggctaac 25860 
acaaaaatta gccgggcatg gtgggtgcct 25920 
gaagaatcac ttgaacccgg gaggcggagg 25980 
tccggcctgg gcgacagagc gagactctgt 26040 
agcaaacccc acagagtcga gaatcgctag 26100 
ccctgtgaca aattaccaca aactcggtgc 26160 
tggaaatgaa aagtccaaaa tcaggactgc 26220 
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ggggctgaag tcaaggtgtg tggaggcctc 
tgcctctccc agcttttgaa ggctccaggt 
ctcggtctct gtggtcgcac tgcagcctcc 
tccgtattgt gaccgcgttt aggatgcccc 
tcatggtgtc aatatattga gactcttttt 
tcagggtggg gacttacctt tgggccaacc 
aatacaaagc gtgcgccagc ccagccctga 
gacagagtct cgctctgtcg cccaggctgg 
acctccgcct cctgggttgt atagattctc 
caggcgtgaa aaggagcaag gctctgcccc 
catgctcagt gaaagacgcc agacacagaa 
aaatgtccac aacaggccca tccacagagg 
ggcagagggg atgagtgaca gctgatgggg 
actagacagt cgtggtggtt gcacaactct 
gtgcagtggc tcatgcctgt aatcccagca 
ggtcaggagt ttgagaccag cctggccaac 
aaaaattagc tgggtgtggt ggcaggtgcc 
aggagaatcg cttgaaccag ggaggcagag 
ctccagtctg ggtgacagag ccagactccg 
ctgaactgta cagtgtaagt gggtgaattg 
cattaaagaa actcagacgc ctggggtggg 
atcatttctc acaaggccct cagatcaccc 
gggaagacct gggctcctgc tggcgagaag 
gcgccctctg cccagcgtgc tccgtggagg 
actggcccat tttacagagg cggaaccgaa 
tgggaagcca ggccaggctg gacctcagcc 
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gctccctcca gaggccctgg ggctccttcc 26280 
gtgcttggcc tgcggccaca tcactcccgt 26340 
tcgtctgcct gtgtgaaatc tcctcctgtc 26400 
aggacaatct tctccatatc gttcagatct 264 60 
ccaaataagg caaatgtcac attctaggga 26520 
acagaggcta caaagaggaa gacaccactc 26580 
tcggtgtttg ttgttgttgt ttttgtttga 26640 
agggcagtgg catgatctca gctcattgca 26700 
ctgcctcagc ctcctgagta gctgggatta 26760 
agccacagcg cggatgcacc ttgaggatgt 26820 
ggacacacag tgtgtgatcc cctttatatg 26880 
caggaagggg atgtgtgggt gccgggggct 26940 
cttcttcttg cggtgatgga atcttctgga 27000 
acgaggtact aaaatcactg aactggctgg 27060 
ctttgggagg cagaagcagg tagatcacga 27120 
atggtaaaac tctgtctcta ctaaaaatac 27180 
tgtaatccca gctactcagg aggctgaggc 27240 
tttgcagtga gccgagatcg caccactgca 27300 
tctcaaagaa ataataataa aataaaatca 27360 
tgtggtatat gagtgatgtt tccgaggtgt 27420 
gccagtctca ccgctgtggg tcccatcccc 27480 
ttccgcggtg gggggcggac actctaagaa 27540 
gcggtggaca tttcttcagt gtctggtgcc 27600 
gtctcattgt cttcctccag acgtctcttt 27660 
gcttggggtg ttggccacag ggctctagtg 27720 
atggggaccc ctgtccctga gactgtggca 27780 
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cctgccacac cctctgtgtg acccgcctaa 
gccaccaaga aggcatccca gcgtccagcc 
gcaggacagg ctggtcgcca tgtcttcagc 
gggctgtggc cgcagggcca ggactgggct 
gccgcccctc tcaggcctcg gtttccccag 
gtgagggtca ctgtgctcac cttttggggc 
ggggccagcc gccttgctgg gtggttcccc 
agcaagtgtg ggtgcgaatg gggccgcccc 
ccggccaccg acaggtctac gtggctatcc 
acgtccccag tcgtcacctg cccacccacc 
tttcgggaaa ctacaatttg cggggagaga 
ctggtcccac ggctgacgca cgcggcagga 
ctccgcctct gcccctctct gtctctgtgt 
tgtctcttga ctgccgccgt ctttctgtct 
aggtctctgc ggcccgcgtc tcacactccc 
ggggactcgg ctgactcacg gacacgcccc 
agcgcggagt cccgcacggc cgcgcccctg 
attggccggc gccctagcct ggtcccgccc 
ccattggccg gcgtccccgc cccagcgaac 
cccgccgcgc gaaggcagag ccgcggacgc 
tgccgggtca tgcgccgccg cctgtggctg 
ccggacgccg cgggaacccc gagcgcgtcg 
ggcgacgtgc gctggcggcg cctcttctcc 
ggcggccgcg tgcagggcac ccgctggcgc 
ggggcctggg gtggggaggc ggcgggtgac 
tgcgcccgcg ggggagtccc ggaggctcct 
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gccaggaaga gagggtcagg agatgcctga 27840 
agaccggtta tccctccaga gggctccccg 27900 
ctggtgctat ttaaaggtgg gtgccacctg 27960 
gctgggagct gtgtccccac agcggaggtc 28020 
ttgtcaatgc ctccacttgg ctgtgagtct 28080 
ccagcgcatg gggcaggcag aggaagggtg 28140 
gtggggcctg gggtatggct ctaagggagg 28200 
attcctgccg cctccgacgt gccccgccag 28260 
tccctcctgc ccacctacct gcccaaacac 28320 
cgcgcattcc cacacccttg tgggcctggc 28380 
agtcccacga gggcatgccc cggagcctgg 284 40 
cctcccgtgt ccatctctgt ccccaagcat 28500 
ctctctcgtc tctcccggtc atcttccttg 28560 
ctgtctccct ccgggtctct gtctccctcc 28620 
gcccccgcaa cccgaggtcc tagcccgccc 28 680 
gcgagacaaa caacaaacgc gcggaggccg 28740 
tgcacctggc ccccgccccc gagacgtccc 28800 
aagtggaccc cgcccccgcc ccgaggcacc 28860 
ccggccccgc ccccgaggcg ccccattggc 28 920 
ccgggagcga cgagcgcgca gcgaaccggg 28980 
ggcctggcct ggctgctgct ggcgcgggcg 29040 
cggggaccgc gcagctaccc gcacctggag 29100 
tccactcact tcttcctgcg cgtggatccc 29160 
cacggccagg acagtgagtg cggggcggcg 29220 
ggcaacgcgg ccgccgtctt cacggtgacc 29280 
ctgtgcagcc tcggcctcag tttccgtggt 29340 
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ccgggtgcg tggctcacgt cggtaatccc 

n f.acacggtg aaaccccgtc tctacc.aaa 30480 
acg.ggtcag gagatcg.g. ccatcccggc taacacg, 
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gt acacc=. g « g .caa g = «c, 9 acacc ea gg cc gggg t g a g c g a g t g t , gg t g a„a 30860 
glgg c gg c tg ,cccc, g „t c«t,ct„a C aa,acactt ea g ctca ggg t gggg c. ggg 31020 
ct caceca gg g =t.cccaca g a= g at gg = g tcc.a.tc tg g ctct g ccac t c„a gg cct 3,080 

ea»ct gg c C c ct« gg .ac g t ggg « 9 « g ™° 
WM a^ct, t ggtg tccc. taa.ccacc, gtg tccc.ct «a g9 =cact 31200 
g cca g ccc g9 cc«=. 9 tcc a 9 a g tcce ag tcc^cc aa,t g aot g t g c gtg99 ~ 9 31260 
gg » g ct g . g « gtg a ggg c c ggg «, ggg .ctccatat, ggg t gg t,t g - g ct g t g . gg 313.0 
g cc 9gg =t 99 ggg ctcct, t ggg ,t, g t g tg a g ct,t g a ggg - gg ,« ggggg ct==a 3,300 
t , Cggggtgg tgtg a g ct gt ,a ggg ac ggg ct g99g9 ctc c.tat gggg t g9tgtg . 9 ~ 31440 
gtgag99 =c g g9 ct gggggg t cc« gggg t gglg t g a g ct gtg . ggg o= g gg ct 5ggggg 31500 
t ctat ggg9 t gg , g t g a 9 ct gtgaggg cc g gg =t ggg5g o tccat.t ggg 9 t 99 t g t g . 9 31560 
c tgtg a g99 c Cggg c tgggg g ct=c«at g ggg t g9 t g t g .,^00 g cc g99 e t9g 31620 
gggg tcc=t 9 9ggtggtg t g « g ct g t g . gg g cc ggg ct gg gg .«oc= tg ggg t g9 t gtg 31680 
ag otc t9 a gg 9 ec g99 ct 9g «««««. ggg t 99 t 9 t 9 a.ct.t,-,. 31740 
ggg «c=ata t ggg ,t ggtg i g . g ct« g a 9gg =c 9g9 ct ggggg =tccc t 9gg9 t g =t 9 31.00 
rtgg tc 9 ct g 9 ctcat tg ac a g «,tca g t gg tct 99g t g 9g ccct g ccc cttct g ,«c 31868 
ccacatccca 9g aaeccttt cccaaccttc ctc g t gg t g l t 9 ct 9 ccccc =t„ac g tcc g 31820 
tc cctct ggg t,t g t ggg » g ccc=cce g ec atacacacac a=a,.t,ct g ct«t ggg ct 31,00 
g>g « g c. gg acc t , g ccct =cca„ gggt =««tc,at ct« g c« c 320.0 

cccccte, t ggggg cc,t cc tg c«ccc ^tccctct, cct g ctce« g =tc«<»ct 32100 
c,c.« g « gg ggg9gg «=c ***** cae gg «« g 32lM 
gtgg tc ggg . ggg9 ac t = g t gg tccc, ggg g tctc« gg t atct g te g t g g tc« g a ggg 32220 
ccc tg eac g a a,ca g . g c gg aca g c. g c gg t g « gggggt ^caa g9 ccctcccc 32280 
g accccc g cc tc=acc. gg c atc= t99 a 9 a toe**** acac,t ggg c M#- 32340 
t c,a ag ca 9 t g tcctc, gg c «c t ac g t g9 ~a tg aacc g cc gggg cc g c ctctac ggg < 32400 
c gg t g a gtg c c ggg c. ggg = t ggg c 9g c g c ggg ca, ggtg 
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,, c t acaccgtgga ctgcaggttc cgggagcgca tcgaagagaa 32520 
cccccgcccg cagcgactct acaccgcgg 

„.c .c«.c 9 cct =.c. 9 a 9 ct 9 9 c 9 ~ 9 c= g c gg cc. g ccc t 9 «cc tg9 c 3 258 0 

ccact «c tg ccc gt ec tgg — «~ »~ — 9t9 ° 3 " 00 

^ =c 99 = M c,c «^c«cc ccc t9 c 999 c t c t9 ». 9 e 9 3 2 3 6 0 

e^ccc ^ -*Wt- — 32820 

cag c« 9 .„ gggtgg t, g = c,CC 9 C. 9 **— t ^t gg , g . ™.C 3,000 

^ — — » 9 ' 5 " 9tOCO 329,0 

a99 a. 999 cc ^ — « —» ~ "'— 9 t999 "" 9 ' 33000 
a9 c 9g . g9g9 a ggggagggg g cc ggg3 . g . a 99 <— = 9 ca 9 c. ga . c.ea. g cc« 330*0 

gaag ac 99 ~ cc^ccc,,, — * a9 " a " t9 33120 

c. 99t9g9 .e gg a g «„. g .a— — * — >9t9tC9tM 33180 
ca999 tactc .= 9 aa. 99 a 9 9 C 999 — ^ ««~ " -«-~ 
9 c tgt99 cc 9 tga a gW a g ,.a ggg cec t .. g ct g . 999 ctca. 9 at 9 c 33300 

t9999 .a 99 . , gg aac gg « cc 9 c.« 9g . 9 ~~c 999 . t 9 ™= 33300 

. Hn . ac tctggcct ccagaaccgc cagaaaataa 33420 
ttgatatgag cccagtggga cccagtttgg actctgg 

acgl . 9t ,, g cca t c..c« t 9 «.c. 9 c, « 

^ .at 9 «ctc. t ,acaaa. t a . g cc t e, 9 .t ccccc 9999 . M***. 333,0 

ro «^. — — ' a "" 9 " 9 ° " C " t9ttC 33600 

ctcg9 . t< =ac cc g9g .. g99 c. g . g9 ~ 9 , ctcc t ca g , t ccc Mg99 .. 33SSO 

99g .. g . g9 c , gaggg c. g , ggtg » g9g c 9 c«c. t9 cc »3 2 0 

g9tgt cct 9 c ^a .,^ — C "" 9t9CC 33780 

ca ,a 9 ccctc c^.tcc* 9 .ct== 9 e C9 99 acttct 9 c =tac, g occ t 999 a 9 «aa. 333,0 
c t =ca 9 cctc t c. gagaaa a g9 «. 9 a 9 ec —c. c. g co, g9 . 9 cca 9g ca 9 t 9 33300 
a _« 999 c« 9t «c=c t 9999 c 9 aca 9 c, 9 c.cc 9 c ae t9gl9 .. 9 33300 

™ 9 a c 99 cc. 999 c t cc. t accca 9 ce 9 ca 9 ac t tae.cat.aa taca 99 a, 9 . 3,0.0 
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ct^cca, g c.«t g ct g g,,,^c tg 34080 
a ggg . ggg t= g t g cec gg cc c. g a g cttc t g «c,=cct g ca g ,c g a. g c g . g « C c S 34110 
cocoas eacc, gg c gg cctct gggg a ca t ctt tgg c t gg cat gg a, t ggg t gg . gg 3,200 
,c, ggg c tg c accent, tcc=ca gg tt gg =a g t g t g . gggg . g . t = g -J—** 3,260 
g caa g ta gg . actt g „tt g t=ac tggg a g ct g c.c gg t c.cca=. g ct 3,320 

g a.at,a.ac tt g « g5 cac a 5 c.c g oa gg aac g «acat ^cct^.a g a.ac gggt c 34380 
. g c» gg <:c gg g c 9 c ggggg o t<=.c g ect gt catccca g ca ctt tggg ., g ~ g a gg c ggg 34..0 
tgg atca= g » gg t=. gg a g . tcaa gg c=at ctt gg tcaae at gg t g aaac ccc^cta 3,500 
ctaa.aatac aaaaaatta, cc ggg c gtgg c^mmo ct g taa«cc a g ctactt g a 3,560 
g . gg c tg a gg . gggg a.t<= g ctt g .atcc g gg . gg = gg a g g « g ca g t g . g ct g . g atc g 3,620 
ogc oa« g c, ctcc. g cc t , ggCg .c. g a g c g a g .ctcc g tctc.aa.aa aaaaaaa.a. 3,600 
a. g aaac. gg tca g ca g tt g t«cttt g « tctaaaaca, a g c g t gg .at ggg o g t,ca, 3,7,0 
W c.« =oc, ggg c. g t g aa.tc« g g ,tc,=,c, g a g cc= t c. g c a g ct t a«c g 3,000 
ca , g cccaa. octree c<=c g tt g tcc t Mgg c. g t g a ggtgggg9 = cccc=aac. g 3,660 
, g a, g . g = gg cot ggggg =. c. g a.cca g c gg ct<=c C ca g g aaatc g cc. g ca g t g a,a. 3,*20 
ta. g .c.,cc cc.ctgtt g o.».« g t g cttcc g c«, c g .a„=a«c ct g , g c gg ca 3,900 
ggg c 9g at gg gg a 9 a ggg c g g ot g c. gg c g c g . gggg ccc gggg ac g c, g ggg t g c ggg c 350,0 
ctt,cca ggg ce=t g «c« g tc g t g c, g c. gg ctcct ggg g c. ggg .. g a cee, gggg c 35!00 
gg cc.c«tct tactic t g .cctc g . g c,at g <= gg cc teaca g cca= cacca ggg t g 35X60 
cc gg t g tcct ct ggg ccc. g cccccca, g ctcat g c=t ggg t gggg c g aaccaatc gg 35220 
t«t g ctcct ct gg ccact= cac,c g a ggg a. g tccc. g c ctcaca gg ca gg c g =acaec 35280 
cc g ,ca g cat ctct g ac.a, gg ccctc=. g ttcc gag t=t cc gg tcc eg e cg ct g caa, 353,0 
c<=tccet g e cca 9 ccctcc tctec. g ct= caactccaac tcccaa g .ao 35, 
acaca g a.ce c g a g cctt g « ctccctcaac g cc t ect g ao tca.actec atcttccaac 35, 
a gg a..ac gg ctc gg cc ggg gg act,t g .c cc gg a g ea gg c gg coca g cc t g tc g c g c. g 35520 
,ct= gggg cc taaaacactt g ttctctca g tce gg a g .tc a. gg ac g atc c g a gg ta.cc 35580 
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tcectacctc 9g t g tcctcc at 9 caac g tc 9t c«a 99g c acc 99g tac 9 tt,ccte 9 t g 35640 

a g9 a 9 ce g . g tcc 9 c 9g9 tc ct g9g9 « g , g .t 9 t 99 ac g ccot„ ggg c t gg caetct 9 35700 

ccc tgg c g9 c cac. g tcat g g a. g tccc. c g c««ctc gg ctcc g caa 35,60 

c gg ccac g a g g a ggg ccc g = c. Cg c.c g ac cecals a ggg cc= g cc 35820 

wg .=c cc. g a 9gg c g g «acca g9 a ggg ccc g coa c gg =,« g c g g ca g ca g ccc 35880 

a g ,a ggtg cc ct 9 e g cac gg tc. gg a C a gg t ggg a«c g a g «a g ct ggg c.a ggggg ct 35540 

g ,c g ca,aca c g tc 9 = ggg . caca g t g aa g a gtgt9g t 9 . a 9 a g c 99 a g9 9 o ggg . g tet 3,000 

9g t. 9ggg c g « 9gg ca= g = c g .a gg a g =c g t«»=c gtg 36060 

WM . C .C 99 t 9g tcct 9 ct 99 a g9 ctc ctct=c 9 «a 9 « g t«o c . tc,tct g .« 36 12 0 

ct t99 .tccc a 99 .t g9 t 99 g ateatc, g c aact g a g at g >ac c c.« g c ccc 9g =cccc 36380 

t9 . 9 occ 9 c. 99 t=ccc,c 9 cctt 9 =c, g c t 9t9 =cc g a g ct gg ct g c.c o=c 999 =o. 9 36240 

g catcc» g <=. acc« 9 » 9 c. ****** g «tttca 9 a , g9gg cc, gg aaccc g c= g t g 36300 

gctg a gg t g t ^c,.« t 99gg c. g , 9 «**™=c ct gg c g c«t a.c g « g9 t 9 36360 

ttt « g9 ttt taa.tttc.c 9 .coc, g t g . cact g ccac= =t 9 ct.cctc 9 cca g ca g == 36420 

= t cct 9g9 ct ta,cttc 9g9 a g . g ca,tt t t 9 «a 9 c= 9g oc=t g9g t g c c.a g =cct g = 36480 

, g9 a 9g c 9 c. ,.===« gg a g .c. gg .cc g g »ct C t g e, 9 a 9 ccc g a=ca 9 c c tccc. g c 36540 

ttgg cctt« ect g ,c g c.c 9gg c 9 ca g a. g9 aa» 9 c.ao a 9 c.cc 99 c t t rt =«t g ta 36600 

a^t.t tttaa»ta 9 c t «caa g ,ta eacat.tttt ttcctttaa, aaa g tc tg « 36660 

gg , g ca g ttt t 9 «ct t9 .a tttt 9 ot 99 t catcctcat, 9 tcc c9 . 9 cc cc-c=ta«=c 36320 

„„,, „aaaaoact g 99gg «cao g t gg cc cg tcc t gg c gg cacc 36380 
gggtcgtgga ggcggccgag ggggaggctg gyyy 

t9 c. 9 ca= t9 9gg9 a 9 cc 9 c t g .acccc 9 t g =t«c. g c g c t 9999g . g cc 9 ct gg9 cccc 36840 
9tctt cc 9 cc ac.a.cca tg ca tgg cc 9 ac ,c 9tg a 9 ctc aa. Cg tcc g t ttatttcaaa 36,00 
9 ca 9 taa t ,a tttaaaatta taaaaatctt t cc.cc 9 =t 9 a.c 9t tta g a 99g t g a gg « 36060 
ag ,c, g . gg . c gggg . 99 ct ***** c. g . g9gg .o -«*«~ ac g cctt=ce 33020 
a. 9 cca 9ggg g cc g9t999 c c 999 ccc g9g t cct g cect g g .ao, gg c g9 9 acct 9 ca g c 33000 
g ct 9 acca 9 c caa 9 = 9 t g9 = 9 oc g cc g999 caccca gtrt '3240 
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^ cc5ac9t " 5 ctc *° 9a9tt 31200 

g ctt g t gg c g ttctc 9 tt,c tggg = g . g <* «p-w- a99a99a9aa 31260 

g c t c.ccca. 5tW9 cca 9 , 9OT5 ttc,t gg cc gtg «= t*»«» gg ctttttc g 3,320 
g ca g ac 9ggg ca g « g tct g c« tgtggg g aca g , gg c. ggg .c ggg . g .a gggg oa gg 373,0 
«. g a gg = gg .OTOT g tc, ggg t gg MOT* — »— — 37 "° 
=ca g cc, ggg cac g .t,ea g =c g tc gCgg a aca gg t gg tt g c. ggg c. g c t g cc g cac.c 3,500 
,„ c g = g ta g t= g tcc« g cac, =a ggg c.«c g . g ~c gg . g cct g c 9gg a g 3,360 
tgtg = ag « g = gg toaca g c ggg c,t gggg gg ==t g cc g a g c«tc.a gg g c. gg ot.ct 3,620 
„a« g =c t c a g cc gg . gg c c g ccc« g a g c C c» g c g a gg gg a,aaa, g c c g t g t gtg t g 3,680 
to „cc ggg c t,cc. g a ggg g a.ct gg ac. g ..ccct«c «ccc. gC c= «cttc. ggg 3„« 
aaa tg c t c g a gg cc ggg t,c gg t, g «c, C g cct g tc.tc cca g cct« ggg » g ,cc g a 3,800 
g ,c. gg a gg a tc,cct g a gg tc. gg » g tto g , g ,<=ct g cc t g .cc,acat gg t g a,.ccc 3,860 
tgt ctctact g aa..t,c.a ,»t g a g cca gg c g t gg c gg c ggg t g =ct, ta.ttcccac 37820 
t,ctc ggg . g g ct g » g «ct catacctac, toctectca, tg .c gggg .= H t nM 3,080 
, cotgg at« tctctttatc t g cc ggtggg tttcaact, att g a gg a g e 380,0 

tg .a, g aca. g . gg = g a g .O t g co ggg . g c tcc t c ggggg «c gg ccc, g gg «c tg .a, 38300 
c g c„a gg ct g e, gg acct g c aaa. g cac= g „ g cc g c g tt t g tcc,: ggg c octree 38360 

38186 

ttggagcccg cccggggtcg gagatc 

<210> 39 
<211> 720 
<212> DNA 
<213> Unknown 

<220> 

<223> Description ol Unknown o.: g anism:Unknown 

^Sct t g acc tg .c t «ca tg aato g .aaa gg aaa tcct=tat g a ae g cact g ca 60 
tc g c.tc gg c g cc gg aac g o tact gg ec g t g « g ctc g ot ttt gg cct g a cc gg ct g c gg 320 
gg . g aa ggag g8gg ttca g c . g tc g ctc g a ,cc gg t gg c t t«cae g act c tg ao g . gtg ,00 
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tcacgtgtgc ggcatgatca tcactgactt ccccggcccc aagggccagg cggtcgaaaa 240 
gcggggagtg aagaaatttt gttccaccgc cgaaatgctt ggttggtggc tgcagccgga 300 
aaaccgtctg ctcgatgcca agctctacgt ccacgacatg gggcgcagcg tttgggaaaa 360 
gccggatgac ggtcatctga tcgacgcaac cagcgcctac tatgtggtcg gtacgtcact 420 
caaaggcgcc atgggcgcgt cgcttgcaag ctttgccgag gagcaggacg ccaaggcgct 480 
tgccggcatg cacggcggtc gtgtgctgcg cttcgaggaa atcgatcagg cgctgctgca 540 
ggaggctgca agcatgcagc acggcggcat gcacgaccat gcgccaaacg' gtgcacataa 600 
cgcacacgca ggccactgag cagcagtggt ctgaacagca cacacaagaa atcgaggtaa 660 
gcacaatgat gggtatcagc gtctggcaac tcctgatcat tcttctgatc gtcgtcatgc 720 



<210> 40 
<211> 127 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (9) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (101) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (119) 

<223> May be any nucleic acid 

****** ^•» ,cw c,cctc,toc 60 

tcgccctcct ,ccct„,t, csgcccccc, g9 t«o, g cg ncccctcc cc=g<,c,tnc 120 

127 

ccgcgct 



<210> 41 
<211> 6858 
<212> DNA 
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<213> Unknown 
<220> 

<223> Description of Unknown Organism: Unknown 
<400> 41 

actcgccaag tgatcgaccg gcccctgagg gccgcgacgc agagggcgcc ccgtgcactg 60 
gcacaggcgg ccttgtgcgt tagactctga tattcgtgcg ccctctcgtt ggcaggacca 120 
tccatcctgt -gtgccggggg ccgcgcacac cgatcccgga tccgcctcgg ccctgccctg 180 
cgcgcccctc cgttctcgac ctccccgacg ctgtctgaac acgcgtcgcc gggggacgac 240 
ggcgggcggc ccgcctcggg ggaggggtaa gcgtcccggg atgcccgttc aaccgttccg 300 
caaggctcgc ccatcgtggg ggagaaccgg cgcgacgcta ggagagacaa gtgatccagc 360 
aggagtcgcg gctcaaggtc gccgacaaca ccggtgcgaa ggaaatcctg accatccgtg 420 
tgctcggcgg ttccggacgc cgctacgcag gcatcggcga caccatcgtc gccaccgtga 480 
aggacgccat ccccggcggc aacgtcaaga aggcgcacgt cgtcaaggcc gtggtggtcc 540 
gcacccgcaa gcagtcccgc cgtcccgacg gctcgtacat caagttcgac gagaacgcgg 600 
cggtcatcct gaagaccgac ggcgagcccc gtggcacgcg catcttcggc cccgtgggtc 660 
gcgagctgcg tgacaagaag ttcatgaaga tcgtgtcgct cgccccggag gtgatctgac 720 
ctcatggcca agatcaagaa ggacgacctc gtgcaggtca tcagtggcaa ggacaagggc 780 
aagcagggca aggtcctgcg cgtgttcccg acggatgagc gcgtgctcgt cgagggcgtg 840 
aaccgcgtga ccaagcacct gcgcgccggc caggacaaca acggttccac cgagggcggc 900 
ctgcaggtcg tcgaggcccc gatccacatc tcgaacgtgg ccgtggtgga cccggagacc 960 
aagaagccga cccgtgtggg ctaccgcttc gagaccgtcg agaaggacgg cgtgacgaag 1020 
accgtgaagg tccgcttcgc caaggcctcg gggaaggagc tgtgatgacc gaggtgcagc 1080 
agaccgagaa ggtcaccccg cgtctgaaga ccaagtaccg cgaggagatc cgcggacgcc 114 0 
tgcaggagca gttccagtac gggaacgtca tgcaggtgcc gggcctcgtg aaggtcgtcg 1200 
tcaacatggg cgtcggcgag gccgccaagg actccaagat catcgacgac gccgtcaccg 1260 
acctcaccgc catcaccggc cagaagccga tgatcaccaa ggcccgcaag tccatcgcgc 1320 
agttcaagct gcgtgagggc atgcccatcg gcacgcacgc caccctccgt ggcgatcgca 1380 
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tgtgggagtt cctggaccgc ctggtcacgc 
gcctgtccga ccgccagttc gacggcaacg 
ccgtgttcca cgagatcgat caggacaaga 
tggtgacgac cgccaagaac gacgacgagg 
cgttcaagac cgaccagtaa gacctccacg 
ggaaaccacg gcgagaaagg gcgtgaagca 
gctgacccgt ctgcgcaacg caaactcggc 
gaagctgaag actcgcgtcg ccgagatcct 
cgaggaggag gccgaggtcg gcaagaagct 
tgagcgtgcg atcgccggcc tgcgccgcat 
gtccacgaac ctgccccacg tgctgggcgg 
tggtctcctc acgaaccagc aggccgccaa 
ctacgtctgg tgacgggcaa gacggaagaa 
tctcccgatc accatccccg ccggcgtcga 
gaagggcccc aagggcccca agggtcagct 
caccctcgag gaggggcagg tcaccgtggc 
cctgcacggt ctgacccgta ccctcatcag 
ctccaagcag ctcgaggtcg tcggcaccgg 
cgagttcgac ctgggctact cccaccccgt 
cacggtggag ggtaacaggg tcaccgtcgc 
gaccgccgcc aacatccgca agctgcgccg 
cgcgggcgag cagatccgcc gcaaggccgg 
ggcaagggca agttcaacgc ccgcacccgc 
ggcaccacgt ccgtcccccg cctcgtcgtc 
gtcgtggacg acacgcagag ccgcacgatc 
cgtgcgctcg agggtgacaa gacggccaag 
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tgccgctgcc gcgcatccgt gacttccgcg 144 0 
gcaactacac cttcggcctg tccgagcaga 1500 
tcgaccgcgt gcgcggcatg gacatcaccg 1560 
gccgcgcgct gctcaaggcg ctgggcttcc 1620 
ccacaggtcc tccaccggtg aaccggtggc 1680 
catgaccatg accgatcccg tcgcagacat 1740 
ctaccacgac accgtgtcca tgccgtcctc 1800 
caaggccgag ggctacatcc aggactggcg 1860 
gaccatcgac ctgaagttcg gcccgcagcg 1920 
ctccaagccg ggcctgcgcg tgtacgcgaa 1980 
cctcggcatc gccatcctgt ccacctcctc 2040 
gaaggctggc gtgggcggag aagtcctcgc 2100 
aggctgaact gacatgtctc gaatcggacg 2160 
tgtgaccatc gacggcgacc gcgtctccgt 2220 
cgagcactcg ctgcccacgc ccatcacggc 2280 
ccgccccgac gacgagcgtg agtcccgctc 2340 
caacatggtc gagggcgtga ccaacggctt 2400 
ctaccgcgtg caggccaagg gccaggacct 24 60 
cccggtgaag gtgtcccagg gcatcacctt 2520 
cggtatcgac aagcagcagc aggtcggcga 2580 
ccccgacccg tacaagggca agggcgtcta 2640 
aaagaagtga tgtctactct gaaggtgaag 2700 
cgccacctcc gggtgcgcaa gcggatctcc 2760 
aaccgctctg cacggcacat gttcgtgcag 2820 
gcgtacgcct ccaccatgga ggccgacgtg 2880 
gccaagcgcg tgggcgagct cgtcgccgag 2940 
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cgtgccaagg cggccggcat cgaggccgcg gtcttcgacc gggcgggcaa caagtaccac 3000 
gggcgcgtcg cggccgtggc cgacggtgcg cgagagggtg ggctgcagct gtgaccgaga 3060 
acatcaacca gaaggacact caggtgaccg agagcaccga gaccaccgtc tccgagaccg 3120 
ggtcgggctc gcgagccaga ccaccgagcg cgccaccggt ggccgcggcg gtcgcgacgg 3180 
cggccgcggt ggccggacgg cgatcgtcgt ggcggccgtc ggacgaccga accgtcgtgg 3240 
cgcccaggac gacgaggaag gaccagttcc tcgagcgcgt cgtgggcatc aaccgcgtct 3300 
ccaaggtcgg ccgccgcttc tccttcaccg ccctcgtggt ggtgggtgac ggcgacggca 3360 
ccgtcggcgt cggctacggc aaggcgaagg aggtccccgc tgcgatccag aaggccgtgg 3420 
aggaggccaa gaagtccttc ttccgcgtcc cccgcgtcgg ctccaccatc ccgcacctgg 34 80 
tgcagggtga ggacgccgcc ggcgtcgtgc tgctccgccc ggcctccccg ggtaccgcgg 3540 
tgatcgccgg cggtccggtg cgcgccgtgc tcgagtgcgc cggcatccac gacgtgctct 3600 
ccaagtccat gggctccgtg aacgcgatca acatcgtgcg cggcacggtg gagggcctca 3660 
agaagctgaa gagcccccag gccgtcgccg cccgccgcgg caaggccctg gacgagatcg 3720 
ccccccatgc gatgctgcgc accatggaga acgatcgcgc ccagaagagc gcgaaggcag 3780 
gtgcgtgacg cgtgtttgag tccactcgca agaacatcca gccctcggac gccaccctgg 3840 
tcatcaccca gacccgcggc gtcacgggct ccaagcagaa ccatcgggac accctgcgct 3900 
cgctgggcct gaagcggatc ggccaccagg tcacccgcaa ggccgacgcg gtgacggtcg 3960 
gcatggtcaa caccgtgccg cacctggtgt ccgtggagga "ggtcaacaat ggctgacaac 4020 
gacgccatca aggtccacga cctgcgtccg gcccccggtg ccaagaccgc caagacccgc 4080 
gtgggtcgcg gtgaggcgtc gaagggcaag accgccggtc gcggcaccaa gggcaccaag 4140 
gcccgttacc aggtccgtgc gggcttcgag ggcggtcagc tgcccctgca gatgcgtctg 4200 
ccgaagctcc gcggcttcaa gaacccgttc cgcacggagt accaggtcgt gaacctggac 4260 
aagctctccg cgcacttccc cgagggcggt gaggtcaccg tggacgcgct cgtctccaag 4320 
ggcctcgtcc gtcgtggcca gcccgtgaag gtgctgggca cgggggagat caccgcggcc 4380 
gtgcaggtga aggcgaacgc cttctctgcg tccgccgtgg agaagatcca ggccgccggc 4440 
gggtccaccg agaccctctg acacgccgac ccatcgaccg agggccctgg ccggagcagc 4500 
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cgctcgggcc aggccctggt ccgtccgtgt 
cgtctcgtgc ccccgccccg cggaacggcg 
tcgtgctcaa ggccatcgcc cggatcgtcc 
tcacgctcgg gctcatcgcc gtctatcgga 
actacccggc ggtgcagcag tgcctggcag 
tcgtgaacat gttctcgggc ggggcgctcc 
tgccgtacat cacggcgtcg atcatcgtgc 
agcagctcca ccaggagcgc cgcaggggcc 
tgaccctcgc cctcgccctg ctgcaggcga 
ccctgctcgg atgcagcctg ccgctgctgc 
tggtcatcgc cctgaccacc ggctgtctca 
agaacggcgt gggcaacggc atgtccctgc 
cggccggtct cggccaggtg gtccagacgc 
ggatcggcct gctcaccatg ctggccatcg 
cggtccagta cgccaagcgg cagatcggct 
tcccggtcaa ggtgaacatg gccaacgtca 
tgctcccggg catcctcatc cagttcaaca 
ggatcacgtg gctgagccgg tacttcggct 
acttcctgct catcatcggc ttcacgtact 
agatctcgga caacatgaag cgctacggcg 
ccaccgagcg ttacctgcag tacgtcatca 
acctcggtat cgtggccatg atcccgctga 
acttcccgct cggcggcacg tccatcctca 
agcaggtcag cgcacagatg gagcagcgcc 
cccgatcccg caacgccgtc cgtatcgaca 
tcatgggccc tcccggttcc ggcaagggca 



-182- 

agactcgcac agccgccccg gtgtggccgc 4560 
cacgccccac aggaccagcc gcaggaggac 4620 
ggacgcctga cctgttgcgg aagatcgcct 4680 
tgggcgactt cgtgccggcc accggcgtgg 4740 
cgggcaacgc ccagggcggc ctgtactcct 4800 
tgcaggtgtc tgtcttcgcg ctgggcatca 4860 
agctgctgcg cgtggtgatc ccgcgcttcg 4 920 
aggcgacgct gacgcagtac acccgctacc 4 980 
ccacgatggc ctcgctggcc cgcaccgggg 5040 
gcgacggctc catcctcacg gtgctgctcg 5100 
tcgtcatgtg gttcggggag cggatcaccg 5160 
tcatcttcac ctccatcgcg gcaggcttcc 5220 
agggctggcg cgtgttcgcg atcgtcatgg 5280 
tcttcgtgga ggagtcgcag cgccggatcc 5340 
cacggaccgt gggcgggtcg agcacctaca 5400 
tcccggtcat cttcgcctcc tccgtgctga 54 60 
cgccgcagga cggcagtgcg ccggccccgt 5520 
ccggtgacca cccggtgtac atggccctgt 5580 
tctacgtgtc catcacgttc aacccggtgg 5640 
gcttcatccc ggcgtccgcg ccggccggcc 5700 
gccgcatcac gttcgtggtg ggggccctct 5760 
tcgccttcgc ggtgatcggc accagccaga 5820 
tcatggtggg cgtcggcctc cagaccgtga 5880 
actacgaggg cctgctgcgc tgagccccga 5940 
gtgaggaaca cacgatgacc cgcatgctgc 6000 
cccaggccac ccggatcgcc gacaagctgg 6060 
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ggatcgtccc gatctccacc ggtgacatct tccgccacaa cgtgaagtcg atgacgccgc 6120 
tcggcgtcga ggccaagagg tacatcgaca acggcgactt cgtccccgat gaggtcacga 6180 
accgcatggt cgccgaccgc atcgcccagg ccgacgcgga gcacggcttc ctgctggacg 6240 
gctacccgcg cacgaagggc caggtcgagg cgctggacgc catgctcgcc gaggccggcc 6300 
agtcgctgtc cgccgtcgtc gagctggagg tgcccgacga ggagctcgtg gagcgcctgc 6360 
tcaagcgtgc cgagatcgag ggccgcgcgg acgacaccca ggaggtcatc gagcaccgcc 6420 
tggacctgta ccaccgcgag accgagtccg tcatccagga gtacgtggag cgcggcatcg 6480 
tcgcccgcgt ggacggcacc ggccagatcg acgacgtcac cgagcgcctg ctgcaggccg 6540 
tgtactccgt gcgctccgcc acgggctccc tgcccgtgat ccagccgggc gcggagtcct 6600 
gaccccgtga tcggccgccg ctcgctcgag ctcaagaccg ccccccagct gctggccatg 6660 
cagcgcgcgg gggtggtcct gtccgaggca ctggacgccg cgctggccgg cgcgccgggc 6720 
ttcaccaccg cggagctgga cgccgtgttc gcggtggtgc tggccgaacg cggtgcgacc 6780 
tccaacttcc tgggctacta cgacttcccg gcctcgatct gcacctcggt caacgaggag 6840 
gtcgtgcacg gcatcccc 6858 



<210> 42 

<211> 578 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (5) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (23) ' 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (31) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (48) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (211) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (292) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (308) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (350) 

<223> May be any nucleic acid 
<220> 

<221>. UNSURE 
<222> (384) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (477) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (507) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (529) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (549) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (551) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (558) 

<223> May be any nucleic acid 
<400> 42 

ttctngtcta tggcagagat ggncaggttg 



-184- 



r 



ncgttgagca ggtactgncc atcagccgtc 60 
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ttcagcgcca 


ggtagttccc 


atcgttctgc 


acacccgggt ggctccgctg cttcacgtca 




atattagtgg 


caccagctgg 


gatggtgaca 


atgtcattgt agccataatt ggtgggggtg 


180 


agggacccgg 


agaccttcct 


gcaggagttg 


nctttgcccc cacacacccc gcatttgtcc 


240 


agcttccgag 


gcgagtccac 


cacatggtca 


cagccggcct tgacacactg gncacggaca 


300 


cagatggnca 


gtgtttctgg 


cccacacagg 


gtgccatcaa tcaccttggn ctcgaacact 


360 


ttggaactcg 


ctcctccccc 


gggntcggga 


ggaacaactt gcaggggtcc cgggggggac 


420 


aacccagcat 


tcttggggga 


cccactgcag 


gaggattccc cgtccatgtc aagtgtnatt 


480 


ggtgggcatt 


attcttctca 


caattgntgc 


tccctgaagg ttttcccgnc aaggggggat 


540 


tcccccccng 


ntggaatnat 


tggtacttgg 


gtctccga 


578 



<210> 43 

<211> 305 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (128) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (146) 

<223> May be any nucleic acid 
<400> 43 

catttaagtt tgctagtcct ttgcaaacag actgacgctg agtgtcctgt ctgagtcaat 60 
aagtgcactt ttacctttta acctatgccc tctacttgaa cccgagcaag gtccagtcca 120 
ctggacangt tgatgatagg gtctgncgcc ccataccctc tcctcttccc ccttaggaat 180 
ttgtgcagta ctggaggggt tgcggcaatg ggaggcctgg gtgggccgtg ctgccttgat 240 
atggccaagg gacccagtca ccacagtgga gacccttgtc tgcacctcag taccgcatgt 300 
ccagg 305 



<210> 44 

<211> 333 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> UNSURE 
<222> (82) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (255) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (275) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (299) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (313) 
<223> May be 

<220> 

<221> UNSURE 
<222> (324) 
<223> May be 



any nucleic acid 



any nucleic acid 



<400> 44 
ggcacaggtg 


actttagcat 


gcagagcagc aaagagagag 


caaccaccaa 


catcatccag 


60 


ccgctgctcc 


acgcacagtg 


gntgctgggg gactggtctg 


agtgctctag 


cactgegggg 


120 


ccggctggca 


gaggcgaact 


gtagagtgca gggacccctc 


eggtgeagge 


ctctgccacc 


180 


tgcaacaagg 


ctctggaaac 


ccgaggatgc caagccctgg 


cagaaccagc 


tgtgccccct 


240 


gtgatttcag 


ggggncaggg 


gccattttgt gctcngggac 


atgcggtaat 


ggaggttgnc 


300 


agacaaggtc 


ttncattgtg 


gtgnatgggt tec 






333 



<210> 45 
<211> 102 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 



<400> 45 
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-187- 

<220> 

<221> UNSURE 
<222> (64) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (69) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (71) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (72) 

<223> May be any nucleic acid 

gcagcagcag cgcagcgcag agagagcagc agcagcagca gcagcagcag cagagcagat 60 
cntnctggna nnaaaaaatc gcggcagcag ctgctctagc ag 102 



<210> 46 
<211> 123 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (9) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (51) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (52) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (57) 

<223> May be any nucleic acid 
<220> 
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<221> UNSURE 
<222> (67) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (123) 

<223> May be any nucleic acid 



<400> 46 

caggcaagnc ggcacgtagg agcagcagca 
agggggngcc cgggacccaa ggcgcccgaa 
cgn 



gcagcagcag cagcagtaac nnagtcnacg 60 
cagagaggcg gagcacaatc cactggtcgg 120 

123 



<210> 47 
<211> 109 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (87) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (95) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (102) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (106) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (107) 

<223> May be any nucleic acid 
<400> 47 

ggcacgcagg agcagcagca gcagcagcag cagcagcagc agagagagag cagcagagag 60 
agagagcagc agagcagagc agagcanagt agagnagagc anagcnnac 109 
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<210> 48 

<211> 293 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> (86) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (166) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (185) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (209) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (214) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (219) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (234) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (290) 

<223> May be any nucleic acid 



<400> 48 

ggcacgaggg ggaaactgct ccgcgcgcgc 

gtccgggccc ggccgggcat ggattnaatg 

cttcccttgc tgctgctgct gctgctgctg 

gccgnagctg aggagaacga cttgggttng 



cggggaggag gaaccgcccg gtcctttagg 60 
cctgagcccg ggtcccgctg tcttctgctt 120 
ctgccggccc cggagntggg cccgagccag 180 
cctncccana aaatgggaag gganttgggg 240 
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293 



<210> 49 

<211> 506 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 

<222> (283) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 

<222> (342) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 

<222> (356) 

<223> May be any nucleic acid 

. <220> 

<221> UNSURE 

<222> (362) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 

<222> (364) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 

<222> (368) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 

<222> (429) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 

<222> (454) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 

<222> (461) 

<223> May be any nucleic acid 

<400> 49 

aattcggcac gagcacccgg ccactgcagt cttctgccct gctggacagc agcagcagca 60 
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gcagcagcag cagcagcagc agcagcaaca 
cttctacctc ctttgagccc atcaaggcag 
agctgtcaga aatctttgat cccacacgag 
tgatgtcctc agaagtgttt gcccctctgc 
gattatatct acaacctgga cgagagtgaa 
tntnaacntt tgactgacag ggacatgcct 
gggggtttnc ttgggacttt tcaaccgacc 
ttcggcttcc ttaacgaaag ttttgg 



<210> 50 
<211> 419 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (137) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (221) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (259) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (327) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (385) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (389) 

<223> May be any nucleic acid 



-191- 

gtaacagcag cagttcgtcc ggacccaacc 120 
accccacagg tgttttggaa ctccccaaag 180 
agtgcatgag ctcggagctg ctggaggagt 240 
tttcgtcttt ctncaccccc gggagaccac 300 
ggtgtttgtg anctcttttg atgtgnctgt 360 
tttttggttg ggacccagat tttttgactt 420 
ctanagagtt nagagcaaan aggttggttt 480 

506 



<220> 

<221> UNSURE 
<222> (416) 
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-192- 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (418) 

<223> May be any nucleic acid 



<400> 50 

tttaagcacc aaaacttgtg ttttaatgat gttggatgga aatctttcct aaatgtgtca 60 
tgcatgctct tgtctccctt aatggagaga gtgtgacact gcttagcact tggatggctt 120 
ggggtggtgg ttatgancag cagtctgtca cagctcagcg aggtgaagcc tgtgggcgtt 180 
ttgctctgtg ctgaatggct cagtggccct acaaagcgga ntcagctctt ggtggctttc 240 
tgttgtggtg ggctgctgnt gctgctgctg ctgctgctgc tgctgccctt gcctctaaaa 300 
gaactcactt cctcttcctc ctgctgncac ctgtcttttg' gcttgtggga ttggagtcat 360 
ggggcccaga tggagccttg ctccntgant tatgataggc ccctcggtct cttttntnc 419 



<210> 51 
<211> 495 
<212> DNA 

<213> Saccharomyces cerevisiae 
<220> 

<221> UNSURE 
<222> (177) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (322) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (328) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (342) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (368) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (371) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (375) 
<223> May be 

<220> 

<221> UNSURE 
<222> (380) 
<223> May be 

<220> 

<221> UNSURE 
<222> (386) 
<223> May be 

<220> 

<221> UNSURE 
<222> (396) 
<223> May be 



any nucleic acid 



any nucleic acid 



any nucleic acid 



any nucleic acid 



<220> 

<221> UNSURE 
<222> (404) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (423) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (426) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (436) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (443) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (456) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (460) 
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-194- 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (467) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (468) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (471) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (474) 

<223> May be any nucleic acid 



<400> 51 

aattcggcac gagcaaagtt ctgcgctcca 
tccatctggg ggacacagtg gactctgatc 
cagaggtgca gctgaaggag gaatctgctg 
agtccctggg actcccacct ccgagccagc 
gtctgctctt ccaccacctc ttgcacaaag 
ccggagttcc ttccttgcct tnaggggntt 
tgggaaantt naggnagttn aaggantacc 
ttnacnaaaa attttnaaca gtntaaattg 
ttttttcggg tccgg 



ttgtgggcat caaacgacac gtcaaagccc 60 
agttcaagcg ggaggaggat ttctactaca 120 
ctgctgctgc tgctgctgcc gcagacnccc 180 
tcccaccccc agcatgactg gcctgcctct 240 
cccagtcctc cggcccagaa catcctgggc 300 
ttcagcaagt tnagttcctt gggtcctttt 360 
aggttnttgc catnctttcc agatccaagt 420 
ggtttnttgn ccctttnngg nggntgtttt 480 

4 95 



<210> 52 
<211> 81 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (65) 
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<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (67) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (71) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (75) 

<223> May be any nucleic acid 
<400> 52 

ggcacgcagg agcagagcag cagcagcaga gagagcagca gcagcagcag cagcagcaga 60 
gagananata natanatata t 81 



<210> 53 

<211> 305 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (11) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (62) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (81) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (256) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (289) 

<223> May be any nucleic acid 
<400> 53 

aggcacttga nttgaaaatg gaaaacccta ctgctggtgg tgctgcggtg atgaggccta 60 
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tnatgcagcc ccagggtttt nttaatgctc aaatggtcgc ccaacgcagc agagagctgc 120 

taagtcatca cttccgacaa cagagggtgg ctataatgat gcagcagcag cagcagcagc 180 

aacagcagca gcagcagcag cagcagcagc aacagcaaca gcaacagcaa cagcagcaac 240 

agcagcaaac ccaggncttc agcccacctc ctaatgtgac tgcttcccnc agcatggatg 300 

ggctt 305 

<210> 54 
<211> 307 
<212> DNA 

<213> Hepatitis C virus 



<220> 

<221> UNSURE 
<222> (212) 

<223> May be any nucleic acid 
<400> 54 

tggggtgtga agctccggtg ctggtgcggc gggggactgc ggggccagcc tcagtttaaa 60 
ccccctcagc agtctttctg tcgttgccct ccacactgcg agactctgga gggcgatctg 120 
gaggtctgga agataaccga ttcctgggag atttgggggt agtctccaat ctgtccctgg 180 
ctcatcttgt gacccgaagc cggcggcctt gncaggagta ttctagaatg agtgcacata 240 
aaaatacctt caaacggtag cagcagcagc agcagcagca gcagcaagca gcagcagcag 300 
cagcagc 307 



<210> 55 
<211> 88 
<212> DNA 
<213> Unknown 

<220> 

<221> UNSURE 
<222> (6) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (7) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
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<222> (78) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (83) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (87) 

<223> May be any nucleic acid 
<220> 

<223> Description of Unknown Organism: Unknown 
<400> 55 

ggacanngac tactctctct ctctctctct ctctctctgc tgctgctgct gtgctgctgc 60 
tgctgctgct gctgccgntg tgngcana 88 



<210> 56 
<211> 346 
<212> DNA 
<213> Unknown 

<220> 

<221> UNSURE 
<222> (278) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (288) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (299) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (313) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (342) 

<223> May be any nucleic acid 



<220> 

<223> Description of Unknown Organism: Unknown 
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<400> 56 

ggcacagccc aactggtgat gctgctgctg 
ctgatactct agtggggctg gaagggtggt 
gagggaaaaa aaaaaccggc agccactgct 
cggtcacaag gaaaatggat tcagtttgca 
tctcagcatg ggcttccagg gcagcgattg 
agatgctgag acntcgtgac tccaggataa 



-198- 

ctgctgctgc tgccgccgcc gcctctattg 60 
tcctattcgc accatcgcca accagagaca 120 
gaatgttggg ttcggaggct gcatccgact 180 
tctctccctc ctttaaacag cttctccggg 240 
aggagacntt accaaggngc accacacant 300 
gaaacattaa cngggg 346 



<210> 57 
<211> 496 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (11) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (78) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (195) 

* 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (197) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (286) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (291) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (293) 

<223> May be any nucleic acid 
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-199- 

<220> 

<221> UNSURE 
<222> (315) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (328) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (329) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (344) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (346) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (352) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (354) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (358) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (366) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (399) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (406) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (410) 

<223> May be any nucleic acid 
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-200- 



<220> 

<221> UNSURE 
<222> (418) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (420) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (435) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (443) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (453) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (454) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (459) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (471) 

<223> May be any nucleic acid 



<220> 

<221> UNSURE 
<222> (473) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (474) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (481) 

<223> May be any nucleic acid 
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<400> 57 

gaattcggca naggtgcaca gatgtggtgg atggggaggg ccgcacggga cagaagttct 60 
ccctgtgtat tctgacgnct gagaaaggag catttcatcc gggcggagac caaggagatc 120 
gtcaatgggt ggctggagat gctcatggtc tatccccgga ccaacaagca gaatcagaag 180 
aagaaacgga aagtngnagc cccccacacc acaggagcct gggactgcca agttgggctg 24 0 
ttaccagcag cagcagcagc agcagcagca gcagcagcat ccccantgct ntnggaaagt 300 
tcccaccacc aagtnccaca atttgggnna aaaccaaggt tgtngnagac gngntttngg 360 
gatttnggca ttgtgggttg cttgcatgga aggacattng gttgtnggtn ccttggangn 420 
tacaattacc atttncggtt gtnaaggtta aanntccgnc attcagaagg ntnnaaggtg 480 
ntttgaagtc catttg 4 96 



<210> 


58 








<211> 


268 








<212> 


DNA 








<213> 


Drosophila 


sp. 




<220> 










<221> 


UNSURE 








<222> 


(16) 








<223> 


May be 


any 


nucleic 


acid 


<220> 










<221> 


UNSURE 








<222> 


(51) 








<223> 


May be 


any 


nucleic 


acid 


<220> 










<221> 


UNSURE 








<222> 


(60) 








<223> 


May be 


any 


nucleic 


acid 


<220> 










<221> 


UNSURE 








<222> 


(202) 








<223> 


May be 


any 


nucleic 


acid 


<400> 


58 









aacacttatc cttganagct ctgtttggga agcaggacaa agctacatgt haggaaactn 60 
tggagcctcc gcagactctc caccagcagc agcagcagca gcagcagcag caagagaagc 120 
ttccaattag gcagggggtt gtacgctccc tgtcctatga ggaacccaga agacactcac 180 
cccccattga gaagcagctc tntccagcca ttcagaaact catggtcagg agcgcagacc 240 
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tccacccatt gtcagagctg cctgaaaa 268 



<210> 59 

<211> 471 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (249) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (386) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (449) 

<223> May be any nucleic acid 
<400> 59 

tcgacccacg cgtccgctga ggaacagacg ttccctggcg gccctggcgc cttcaaaccc 60 
agacatgctg ctgctgctgc tgctgctgcc cctgctctgg gggacaaagg ggatggaggg 120 
agacagacaa tatggggatg gttacttgct gcaagtgcag gagctggtga cggtgcagga 180 
gggcctgtgt gtccatgtgc cctgctcctt ctcctacccc caggatggct ggactgactc 240 
tgacccagnt catggctact ggttccgggc aggagacaga ccataccaag acgctccagt 300 
ggccacaaac aacccagaca gagaagtgca ggcagagacc cagggccgat tccaactcct 360 
tggggacatt tggagcaacg actgcnccct gagcatcaga gacgccagga agagggataa 420 
ggggtcatat ttctttcggc tagagagang aagcatgaaa tggagttaca a 471 



<210> 60 
<211> 379 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (2) 

<223> May be any nucleic acid 
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<220> 

<221> UNSURE 
<222> (14) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (31) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (135) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (315) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (332) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (349) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (357) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (374) 

<223> May be any nucleic acid 



<400> 60 

anttcggcan aggnaaggga gagggtgacc 
gttcaggagg catggagctg acaaccatga 
cgccaccacc gtagncagca gcagcagcag 
ggaatagaga cagccagaga gaaatgtgat 
ttcttcagag gggacgggtg atgggcagat 
atcttttctt tttcntaaaa ttgttgttat 



ngcatcccaa ctagatttca gtggagtgaa 60 
ggcctcggca gccaccgcca ccaccgccgc 120 
cagcagcagc aagagttaac tctgacttag 180 
caatgaagga gacatctgga gtgtgcgtgc 240 
ttggaaaaag caccgcagat tgggaacctt 300 
gnaaatttgg gtttttccng taacttntta 360 
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aaaacttaaa agtnggttt 379 



<210> 61 
<211> 255 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (121) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (183) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (254) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (255) 

<223> May be any nucleic acid 



<400> 61 

aattccgaca atggaaagca ctcttagcct 
atttcagcat tctttattac ccggcacgct 
nactgccagg gtcagacagt ccatttgctg 
ggnatggcat tagggaagct gctgtctgag 
aatgttcttt cctnn 



tgcagtggtc tacattttta aggaaccaat 60 
gtgtcctttg tcagagttca agtttatggt 120 
ctgctgctgc tgctgctgct ttctcgaact 180 
tgttagggaa tgtcttggct aagtaaagcc 240 

255 



<210> 62 
<211> 5289 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 62 

cgagctctcc cagccgcagc ctccgaatcc acggcctcca ccccgcgcct ctccagcgct 60 
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ctatcccgtc gctgcgccct tgtcgccggc 
tccttgctgg gcacaaatag ctccaccatg 
ctgttgcatg cctgcggctc caaccgcatt 
gacatctttg aactcaccgg agctgcccgc 
cctgaccctt ctagcccagc tttccgcatc 
gacaagaagt tccaagacct agtggatgct 
gcctccctga ggcaaatgaa gaagacccgg 
cactctggcc aggtcttcag cgtgatctcc 
ctgaccgtgc aggggaagca gcatgtggtg 
cagtggaaga gcatcaccct gtttgtgcag 
gagaagatgg agaatgcgga gctggatgtc 
gccagcatcg ccaggctccg cattgccaaa 
ctgcagaatg taaggtttgt ctttggaacc 
tgctccagct ctaccagtgt ctttgtcacc 
cctgccatcc gcaccgacta cattggccac 
atctcatgtg acgagctgtc cagcatggtc 
accacgctgc aggacagtat ccgcaaagtg 
ctgaggaggc ccccactctg ctaccacaac 
acggtggaca gctgcactga gtgtcgctgc 
tcctgtccca tcatgccctg ctccaatgcc 
tgctggccca gcgactctgc agacgatggc 
tctgtgacct gtggcaatgg aatccagcag 
agatgcgagg gctcctctgt gcagacgcgg 
tttaaacagg atggcggctg gagccactgg 
ggagacggtg tgatcacaag gatccggctc 
aagccatgtg agggcaaagc ccgggagacc 



cccggccgct gcatccgcgt ccgcacaggc 120 
gggctggcct ggggactcgg tgtcctgctc 180 
ccagagtctg ggggagacaa cagtgtgttt 240 
aagcggtctg ggcgccgact ggtgaagggc 300 
gaggatgcca acctgatccc ccctgtgcct 360 
gtgcgggcgg agaaaggttt cctcctcctg 420 
ggtaccctgc tggctgtgga gcggaaagac 480 
aatggcaagg cgggcaccct ggacctgagc 540 
tcggtggaag aagcactcct ggcgactggc 600 
gaggacaggg cccagctgta catcgactgt 660 
cccatccaga gcatcttcac cagggacctg 720 
ggaggtgtca acgacaattt ccagggggtg 780 
acaccagaag acatcctcag gaacaaaggc 840 
cttgacaaca acgtggtgaa tgggtccagc 900 
aagacaaagg acctgcaagc catctgtggc 960 
ctggagctca ggggtctacg caccatcgtg 1020 
accgaagaga acaaagagct ggccaacgag 1080 
ggagtgcagt acaggactgg cgacgagtgg 1140 
cagaactcag ttaccatctg caaaaaagtg 1200 
acagttccgg atggagaatg ctgcccacgg 1260 
tggtccccgt ggtctgagtg gacctcttgc 1320 
cgtggccgct cctgcgacag cctcaacaac 1380 
acctgccaca tccaggagtg tgacaagaga 14 40 
tccccatggt catcttgctc cgtaacatgt 1500 
tgcaactccc ccagccccca gatgaatggg 1560 
aaagcctgcc agaaagactc ctgccccatc 1620 
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aatggaggct ggggaccttg gtcaccatgg 
gtacagaaac gtagccggct ctgcaacaac 
gttggtgatg tgacagaaaa ccagatctgc 
ctgtccaatc cctgctttgc tggtgtccag 
tgtggtgcct gtcccccagg ctatagtgga 
tgcaaagaag tccctgatgc ctgcttcaac 
gaccccggct acaactgcct gccctgccca 
c 9g<jgcgtgg aacatgccac cgccaacaag 
gacgggacac acgactgcaa caagaacgcc 
cccatgtacc gctgcgagtg caagcctggc 
gacacagacc tggacggctg gcccaatgag 
cactgcagaa aggataattg ccccaacctt 
gatggaatcg gcgatgcctg cgatgatgac 
gacaactgtc cattccatta caacccagcc 
gaccgctgtg acaactgccc ctacaaccac 
ggggaaggag acgcctgtgc agctgacatt 
aactgccagt atgtctacaa tgtggaccag 
cagtgtgaca actgccccct ggaacacaat 
attggagaca cctgtgacaa caatcaggat 
gacaactgtc cctacgtgcc caacgccaac 
gatgcctgtg accatgatga cgacaatgat 
ctggtgccca atcctgacca gaaggactct 
gatgattttg accaggacaa ggtgccagac 
atcagtgaga ctgatttccg ccgattccag 
cagaatgacc ctaactgggt tgtacgccat 
tgtgaccctg gacttgctgt aggttatgac 
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gacatctgtt ctgtcacctg tggaggaggg 1680 
cccaaacccc agtttggagg caaggactgc 1740 
aacaagcagg actgtcccat tgacggatgc 1800 
tgtaccagct accctgatgg cagctggaag 1860 
gatggagtcg agtgcaaaga cgttgatgag 1920 
cacaatggag agcacaggtg tgagaacaca 1980 
ccgcgcttca ctggctcgca gccctttggc 2040 
caggtatgca agccccgaaa cccctgcaca 2100 
aagtgcaact acctgggcca ctacagcgac 2160 
tacgccggca acggcatcat ctgcggggag 2220 
gacctgctgt gcgtggccaa cgcaacttac 2280 
cccaactcag ggcaggaaga ctatgacaag. 2340 
gatgacaatg ataagattcc agatgacagg 2400 
cagtacgact atgacagaga tgacgtggga 2460 
aacccagacc aggctgacac agataacaat 2520 
gatggggaca gtatcctcaa tgaacgggac 2580 
aaagacactg acatggacgg ggttggtgat 2640 
ccagaccagc tcgactctga ctcggaccgc 2700 
attgatgaag acggccacca gaacaatctg 2760 
caggctgacc atgacaagga tggcaaaggc 2820 
ggcattcctg atgaccggga caactgcagg 2880 
gatggtgatg gtcgaggtga tgcttgcaaa 2940 
attgatgaca tctgtcccga aaatgttgat 3000 
atgattcctc tagatcccaa agggacatcc 3060 
cagggtaaag aactcgtcca gactgtcaac 3120 
gaatttaacg ccgtggactt cagtggcacc 3180 
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ttcttcatca acaccgagag ggatgacgac 
agcagccgct tctatgttgt gatgtggaag 
cccacgaggg ctcaggggta ctctggactt 
cctggcgagc acctgcggaa tgccctgtgg 
acactgtggc atgaccctcg tcaeattggc 
ctgagccaca ggccaaagac aggtttcatc 
atggctgact caggacccat ctatgacaaa 
gtcttctctc aagaaatggt gttcttctcc 
tcatcaaact gttgatcaaa agactgatca 
accatgggct tagaaaaccc ccaggatcgc 
catgagtgtg gactcctaga acatgtgact 
gaccctgcat tcagcctctg actgagaaga 
ttggcttttg caaaagcaaa agcatccaca 
tgcttttgtc agagcagaat gcgactgtga 
ttttcaggca tgtgagagaa gggaggactc 
cctactccct ctggaatggg ggcgggtggg 
cccaagagat gattgtatga agaaaatatg 
attttcaggg gattgaaaga ctattgctgg 
taacccacat aaataggcac ttaaatagga 
gacttcctcc cagatttcca ccccttaaca 
ggagttaaac cgacacaagg cagggccagc 
gttgtattcc agatgtagct tctgcagatg 
agcgagcacc aggctgtctc ccaagggacg 
aggcacaaaa ttatcaacta agacattcct 
gttttccagt tgtctctttt ggactgtagt 
gtaaactatt tattttttac ttattctggg 



tatgccggct ttgtgtttgg ctaccagtcc 3240 
caagtcactc agtcctactg ggacaccaac 3300 
tccgtgaagg ttgtaaactc caccacgggg 3360 
cacacaggaa acacctctgg ccaggtgcgc 3420 
tggaaagatt tcactgccta cagatggcat 3480 
agagtggtaa tgtatgaagg gaagaaaatc 3540 
acctatgctg gtgggaggct aggcttgttc 3600 
gacctgaaat atgaatgcag agactcctaa 3660 
taaaccaatg ctggtattgc accttctgga 3720 
gcctcgctgc ctgcctttgc tctctgcttg 3780 
tgcctcaaga aaatgcaatt ttccaaatca 384 0 
atcttccaag gagacaaaca atgactttgg 3900 
tgctttggtt ggaaggtgcc tgtcccactc 3960 
ggccagctct gagcagtgga ctccaaaatg 4020 
actagaattg acaaacaaaa ccagccctga 4080 
ggggccaaag cccaaagggg aggatgcata 4140 
gaggaactgt tacatttttg gtactaaatc 4200 
atttcatgat gctgaccggt gttagctgat 4260 
gcagggaagg aaggaaaaga ctggcttctg 4320 
catcacctgt agtgaccaga acagggagtc 4380 
tgctgcagct tggttctatt gaaattgtca 4 440 
tagcagcaaa ataagaatac ccaccatctc 4500 
gcagccatgc ttgtattttt atggttagaa 4560 
tctttctctt tttttcctga acatcatgga 4 620 
ttttagtgtt ttaaacaaac actttacaat 4 680 
ggatctgtct gaaagactat tcatggaaca 474 0 
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ggaagaagcg taaggactat ccatatcatc tttgctacaa gtcattatga ctgtaagatt 4800 

gtaaatacag attatttatt aactctgttc tacctggaat ctagtttcat atggaaagtg 4860 

tttgagagca ggtagttgag atcgatcagc aaatctttca caggaatggc acaaggaaac 4920 

cagcatagca agctgctctt caccttgtgc ttagactgga tgatttggaa ttcttttttc 4 980 

cttttttttc ccaagtggaa ttacttggtt gtccatttgc aagtgttttt agtttgcaaa 5040 

gaaagccaag aggccattaa tactgtctta tcccatccct tgtgcctatt tccagggaga 5100 

tgaaaagcat ctacatttat tatttttgcc tttttccaaa agaaaaaaat gacaaaggtg 5160 

aaacttgtat acaaatatta cctcatttgt tgtgtgactg agtaaagaat tttgggatca 5220 

aacagaaaga gtttaagtgt ctaacaaact taaagctact gtagtaccta aaaaaaaaaa 5280 

aaaaaaaaa 5289 

<210> 63 
<211> 2053 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 63 

gaattccggc ggccgctgag agcccaccct ggcgagctct cccagccgca gcctccgaat 60 
ccacggcctc caccccgcgc ctctccagcg ctctatcccg tcgctgcgcc cttgtcgccg 120 
gccccggcgc tgcatccgcg tccgcacagg ctccttgact gggcacaaat agctccacca 180 
tggggctggc ctggggactc ggtgtcctgc tcctgttgca tgcctgcggc tccaaccgca 240 
ttccagagtc tgggggagac aacagtgtgt ttgacatctt tgaactcacc ggagctgccc 300 
gcaacggtac tgggcgccga ctggtgaagg gccctgaccc ttctagccca gctttccgca 360 
tcgaggatgc caacctgatc ccccctgtgc ctgacaagaa gttccaagac ctagtggatg 420 
ctgtgcgggc ggagaaaggt ttcctcctcc tggcctccct gaggcaaatg aagaagaccc 480 
ggggtaccct gctggctgtg gagcggaaag accactctgg ccaggtcttc agcgtgatct 540 
ccaatggcaa ggcgggcacc ctggacctga gcctgaccgt gcaggggaag cagcatctgg 600 
tgtcggtgga agaagcactc ctggcgactg gccagtggaa gagcatcacc ctgtttgtgc 660 
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aggaggacag ggcccagctg tacatcgact 
tccccatcca gagcatcttc accagggacc 
aaggaggtgt caacgacaat ttccaggggg 
ccacaccaga agacatcctc aggaacaaag 
cccttgacaa caacgtggtg aatgggtcca 
acaagacaaa ggacctgcaa gccatctgtg 
tcctggagct caggggtcta cgcaccatcg 
tgaccgaaga gaacaaagag ctggccaacg 
acggagtgca gtacaggact ggcgacgagt 
gccagaactc agttaccatc tgcaaaaaag 
ccacagttcc ggatggagaa tgctgcccac 
gctggtcccc gtggtctgag tggacctctt 
agctggccgc tcctgcgaca gcctcaacaa 
gacctgccac atccaggagt gtgacaagag 
gtccccatgg tcatcttgct ccgtaacatg 
ctgcaactcc cccagccccc agatgaatgg 
caaagcctgc cagaaagact cctgccccat 
ggacatctgt tctgtcacct gtggaggagg 
ccccacaccc cagtttggag gcaaggactg 
caacaagcag gactgtccca ttgacggatg 
gtgtaccagc taccctgatg gcagctggaa 
agatggagtc gagtgcaaag acgttgatga 
ccacaatgga gagcacaggt gtgagaacac 
accgcccgga att 



-209- 

gtgagaagat ggagaatgcg gagctggatg 720 
tggccagcat cgccaggctc cgcattgcca 780 
tcctgcagaa tgtaaggttt gtctttggaa 840 
gctgctccag ctctaccagt gtctttgtca 900 
gccctgccat ccgcaccgac tacattggcc 960 
gcatctcatg tgacgagctg tccagcatgg 1020 
tgaccacgct gcaggacagt atccgcaaag 1080 
agctgaggag gcccccactc tgctaccaca 1140 
ggacggtgga cagctgcact gagtgtcgct 1200 
tgtcctgtcc catcatgccc tgctccaatg 1260 
ggtgctggcc cagcgactct gcagacgacg 1320 
gctctgtgac ctgtggcaat ggaatccagc 1380 
cagatgcgag ggctcctctg tgcagacgcg 144 0 
atttaaacag gatggcggct ggagccactg 1500 
tggagacggt gtgatcacaa ggatccggct 1560 
gaagccatgt gagggcaaag cccgggagac 1620 
caatggaggc tggggacctt ggtcaccatg 1680 
ggtacagaaa cgtagccggc tctgcaacaa 1740 
cattggtgat gtgacagaaa accagatctg 1800 
cctgtccaat ccctgctttg ctggtgtcca 1860 
gtgtggtgcc tgtcccccag gctatagtgg 1920 
gtgcaaagaa gtccctgatg cctgcttcaa 1980 
agaccccggc tacaactgcc tgccctgccc 2040 

2053 



<210> 64 
<211> 4339 
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<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 64 

agccactgcc tggagtcagc cagcctcatc ggacttctgc aggcaatcgc gaagctgcta 60 
tccagttctg ccacggtctc tcccggcgca ccggcagtct cagcgtcttc accggactca 120 
gcgtccttgt ccttcacttc acctttgcca cctctccggg ttactgagcc ccggtgcaca 180 
caggctccgt gttgggcaca aaggctccac catggagctc ctgcggggac taggtgtcct 240 
gttcctgttg catatgtgtg gaagcaaccg cattccagag tctgggggag ataacggtgt 300 
gtttgacatc tttgaactca ttggaggtgc acgaaggggc cccggtcgcc gactggtgaa 360 
gggccaagat ctatccagcc ccgccttccg cattgagaat gccaacctga tccccgctgt 420 
gccggatgac aagttccaag acctactgga cgctgtgtgg gccgacaaag gcttcatctt 480 
cttggcttcc ttgaggcaga tgaagaagac ccggggcaca ctcctggctg tggaacggaa 540 
agacaacact ggccagatct tcagtgtggt ctccaacggc aaagctggca ccctggacct 600 
gagcctgagc ctgccaggga agcaacaagt ggtgtcagtg gaggaagctc tcctggccac 660 
tggccagtgg aagagcatca cgctgtttgt tcaagaggac cgggctcaac tctacataga 720 
ctgtgataag atggagagcg cggagctgga tgtacccatc cagagcatct tcaccaggga 780 
tctggccagc gttgccaggc tccgagttgc aaagggagat gtcaatgaca attttcaggg 840 
ggtgctgcag aatgtgaggt ttgtctttgg aaccacccca gaagacattc tcaggaacaa 900 
aggctgctcc agctctacca acgtccttct tacccttgac aacaacgtgg tgaacggttc 960 
cagccctgct atccgcacca actacatcgg ccacaaaaca aaggacctcc aagctatctg 1020 
tggcctctcc tgtgatgaac tatccagcat ggtcctggaa ctgaagggcc tgcgcaccat 1080 
cgtgaccact ctgcaggaca gcatccgaaa agtgacggaa gagaacagag agctggtcag 1140 
tgagctgaag cggcctcccc tctgctttca caatggagtc cagtacaaga acaacgagga 1200 
gtggactgta gacagttgca cagagtgtca ctgccagaac tcggttacca tctgcaaaaa 1260 
ggtgtcctgt cccatcatgc cctgctccaa cgccacagtt cctgatggtg aatgctgccc 1320 
acggtgctgg cccagcgact ctgctgacga tggctggtct ccctggtctg agtggacctc 1380 
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ctgctctgcc acatgtggca atggaattca 
caacagatgc gagggctctt cggtacagac 
aagatttaaa caggatggtg gctggagtca 
ctgtggtgac ggtgtgatca caaggatccg 
cgggaagccc tgtgaaggtg aagcccggga 
aattaatgga ggctggggtc cctggtcacc 
aggagtgcag agacgcagcc gactctgtaa 
ctgtgttggc gatgtgacag aaaatcaagt 
atgcctgtcc aatccctgct ttgctggtgc 
gaaatgtggt gcgtgtcctc ctggctacag 
tgagtgcaaa gaagtgcctg atgcttgctt 
cacagatcct ggctacaact gcctgccctg 
cggccgaggt gtcgaacatg ccatggccaa 
cacggacggg acgcatgact gcaacaagaa 
cgaccccatg taccgctgtg agtgcaagcc 
agaggacaca gacctggacg gctggcctaa 
ctaccactgc aaaaaggaca actgccccaa 
caaggacggg attggcgatg cctgcgatga 
cagggacaac tgtccattcc attacaaccc 
gggagaccgc tgtgacaact gcccctacaa 
aaacggggag ggcgatgcct gtgctgtgga 
agacaactgc cagtacgttt acaacgtgga 
agatcagtgt gacaactgcc ccctggaaca 
cctcataggg gacacttgtg acaacaatca 
cctggacaac tgtccctatg tgcctaacgc 
aggagatgcc tgtgaccatg acgatgacaa 
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gcaacgtggt cgttcctgtg acagcctcaa 1440 
gaggacctgc cacattcagg agtgtgacaa 1500 
ctggtctcca tggtcgtcct gttctgtgac 1560 
tctctgcaac tcccccagcc cccagatgaa 1620 
gaccaaagcc tgcaagaaag acgcctgccc 1680 
atgggacatc tgctctgtca cctgtggagg 1740 
caaccccaca ccccagtttg gaggcaaaga 1800 
ttgcaacaag caggactgcc caattgatgg 1860 
caagtgtact agctaccctg atggtagctg 1920 
tggaaatggc atccagtgca aagacgtcga 1980 
caatcacaac ggagaacatc ggtgcaagaa 2040 
cccaccacga ttcactggct cacagccctt 2100 
caaacaggtg tgcaaaccgc gaaacccctg 2160 
cgctaagtgc aactacctgg gtcactacag 2220 
cggctatgca ggcaatggca tcatctgcgg 2280 
tgaaaacctg gtgtgtgtgg ccaacgcaac 2340 
ccttcccaac tcggggcagg aagactatga 2400 
tgacgatgac aacgacaaga tccctgatga 24 60 
agcccagtat gactatgaca gagatgatgt 2520 
ccacaaccct gaccaagcag acacagacaa 2580 
catcgatgga gatggaatcc tcaatgaacg 2640 
ccagagggac acggacatgg atggggttgg 2700 
caatccagac cagctggact ctgactcaga 2760 
ggacatcgat gaggatggcc atcagaacaa 2820 
caaccaggcc gaccatgata aagatggcaa 2880 
tgacggcatc cctgatgaca gagacaactg 2940 
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caggctggtg cccaatcctg accagaagga 
caaagacgac tttgaccatg acaatgtgcc 
tgacatcagt gaaaccgatt tccgacgatt 
ctcccaaaat gaccctaact gggttgtccg 
aaactgtgac cctggacttg ctgtaggtta 
taccttcttc atcaacaccg agagagatga 
gtccagcagc cgcttctacg ttgtgatgtg 
caaccccaca agggctcagg gatactcagg 
cggccctggc gagcacctgc ggaatgcact 
gcgcaccctg tggcatgacc ctcgccacat 
gcgtctcagc cacaggccaa agaccggtta 
aatcatggct gactcgggac ccatctatga 
gttcgtcttc tctcaggaaa tggtgttctt 
ctaatcatca gctgccaatc ataaccagcg 
agagaaaccc cccacccctg ccgggatcgc 
catagtgtgg acttgtaaag cctgagacct 
agtcagcact cggcctttaa cgaatgagaa 
gggtttgctt ttgggaaagc caaagcgcct 
cactctgcct tactcacaga gccagaactt 
gaagcatttt caggcatgtc aaagaaagga 
aacctctgac ataggtcctg agatgtgggg 
gtacccaaga gatgactgta tgaagaaaat 
tcaggggaca gacagacttg ctgcatttcc 
atcttccaca caggcactt 



-212- 

ctctgatggt gatggccgag gtgacgcctg 3000 
agatattgat gacatctgtc ctgagaattt 3060 
ccagatgatt cctctagatc ccaaaggaac 3120 
ccatcagggc aaagaactcg tccagactgt 3180 
tgatgagttt aatgctgtgg acttcagcgg 3240 
tgactacgct ggcttggtat tcggctacca 3300 
gaaacaagtc acccagtcct actgggacac 3360 
cctgtctgta aaggttgtga actccaccac 3420 
gtggcacaca ggaaacaccc ctggccaggt 3480 
cggctggaaa gatttcactg cgtacagatg 3540 
tatcagagtg gtgatgtatg aaggaaagaa 3600 
caaaacctac gccggcggta gactaggcct 3660 
ctcagacatg aaatacgagt gtcgagattc 3720 
ctggcaatgc accttctaaa aacaagggct 3780 
ctttcctcgc cttccttgcc tctcttcttg 3840 
gcctcaagaa aatgcagttt tcgaacccag 3900 
tgcatcttcc aagaccatga agagttcctt 3960 
atttacttcc cactaggaag gtgcccgctc 4020 
cttcgaggcc acctctgagc agcacacaca 4080 
aaaatgactc actagaactc accgccaaac 4140 
aggcaggagc caaagctcta gggagggcat 4200 
gtggaggagc tgttcggtac taaatcattt 4260 
gcatgctgct ggtgagagct gattgaccca 4320 

4339 



<210> 65 
<211> 186 



WO 99/37660 PCT/US99/01313 

-213- 

<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 65 

gcacagttaa tggaggctgg ggtccctggt caccatggga catctgctct gtcacctgtg 60 
gaggaggagt gcagagacgc agccgactct gtaacaaccc cacaccccag tttggaggca 120 
aagactgtgt tggcgatgtg acagaaaatc aagtttgcaa caagcaggac tgcccaattg 180 
gtaagc 186 

<210> 66 
<211> 5774 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 66 

gtcactttgg ttgatagcag ccgctctggt agaggttagg acttcagctg atggacaagc 60 
tggtaatgaa gaaatggtgc aaatagattt accaataaag agatatagag agtatgagct 120 
ggtgactcca gtcagcacaa atctagaagg acgctatctc tcccatactc tttctgcgag 180 
tcacaaaaag aggtcagcga gggacgtgtc ttccaaccct gagcagttgt tctttaacat 24 0 
cacggcattt ggaaaagatt ttcatctgcg actaaagccc aacactcaac tagtagctcc 300 
tggggctgtt gtggagtggc atgagacatc tctggtgcct gggaatataa ccgatcccat 360 
taacaaccat caaccaggaa gtgctacgta tagaatccgg aaaacagagc ctttgcagac 420 
taactgtgct tatgttggtg acatcgtgga cattccagga acctctgttg ccatcagcaa 480 
ctgtgatggt ctggctggaa tgataaaaag tgataatgaa gagtatttca ttgaaccctt 540 
ggaaagaggt aaacagatgg aggaagaaaa aggaaggatt catgttgtct acaagagatc 600 
agctgtagaa caggctccca tagacatgtc caaagacttc cactacagag agtcggacct 660 
ggaaggcctt gatgatctag gtactgttta tggcaacatc caccagcagc tgaatgaaac 720 
aatgagacgc cgcagacacg cgggagaaaa cgattacaat atcgaggtac tgctgggagt 780 
ggatgactct gtggtccgtt tccatggcaa agagcacgtc caaaactacc tcctgaccct 840 
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aatgaacatt gtgaatgaaa tttaccatga 
cctggtgcgc atgataatgc tgggatatgc 
cccatccaga agcttggaga atgtgtgtcg 
caaccactct gaacaccatg accatgcaat 
tggaatgcaa ggatatgctc cagtcaccgg 
gaatcatgag gatggttttt catctgcttt 
gggaatggag catgatggac aaggcaacag 
catggctccc ttggtacaag cagcattcca 
agaactgaaa agatatatcc attcctatga 
ttggcctaaa ctcccagaac ttcctggaat 
tgattttggt gttggctata aaatgtgcac 
gctgtggtgt agccatcctg ataatcccta 
tgatgggact gaatgtgctg ctggaaaatg 
tgctaatcag caaaaacaag atggcaattg 
tcggacatgt ggaactggtg ttcgtttcag 
caatggtggt caggattgtc ctggtgttaa 
atgccaaaaa cactttgagg acttcagagc 
tgaataccag aataccaaac accactggtt 
atgccacctt tactgtcagt ccaaggagac 
gcatgatgga acgcactgtt cttacaaaga 
tgtgaaagtg ggctgtgata aagaaattgg 
ctgtggagga gataattccc actgccgaac 
gaagcttggg taccttaaga tgtttgatat 
agaagacgag gcttctcctc atattcttgc 
tttaaatggc aaaggggagg aagccaagtc 
ggattataac attgaagatg acattgaaag 
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tgagtccctc ggagtgcata taaatgtggt 900 
aaagtccatc agcctcatag aaaggggaaa 960 
ctgggcgtcc caacagcaaa gatctgatct 1020 
ttttttaacc aggcaagact ttggacctgc 1080 
catgtgtcat ccagtgagaa gttgtaccct 1140 
tgtagtagcc catgaaacgg gccatgtgtt 1200 ' 
gtgtggtgat gagactgcta tgggaagtgt 1260 
tcgttaccac tggtcccgat gcagtggtca 1320 
ctgtctcctt gatgaccctt ttgatcatga 1380 
caattattct atggatgagc aatgtcgttt 144 0 
cgcgttccga acctttgacc catgtaaaca 1500 
cttttgtaag actaaaaagg gacctccact 1560 
gtgctataag ggtcattgca tgtggaagaa 1620 
ggggtcatgg actaaatttg gctcctgttc 1680 
aacacgccag tgcaataatc ccatgcccat 1740 
ttttgagtac cagctttgta acacagaaga 1800 
acagcagtgt cagcagcgaa actcccactt 1860 
gccatatgaa catcctgacc ccaagaaaag 1920 
tggagatgtt gcttacatga aacaactggt 1980 
tccatatagc atatgtgtgc gaggagagtg 2040 
ttctaataag gttgaggata agtgtggtgt 2100 
cgtgaagggg acatttacca gaactcccag 2160 
accccctggg gctagacatg tgttaatcca 2220 
tattaagaac caggctacag gccattatat 2280 
gcggaccttc atagatcttg gtgtggagtg 2340 
tcttcacacc gatggacctt tacatgatcc 2400 
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tgttattgtt ttgattatac ctcaagaaaa 
catcatccat gaagactctg tacctacaat 
agatactttt gagtgggctt tgaagagctg 
tttccagtac actaaatatg gatgccgtag 
cttctgtgag gccaacaaaa agccgaaacc 
tacacatcca ctctgggtag cagaagaatg 
tggctatcag cttcgcactg tacgctgcct 
tgtgcacagc aaatactgca tgggtgaccg 
gccctgccct gcacagtgga aaacaggacc 
aggaacggag gtgaggcagg tcctctgcag 
tgagtcggtc agagcctgtc aactgcctcc 
gtccatattc tgtcaaatgg aagtgttggc 
gttatgttgt gagtcctgca gcaagcgcag 
agctgctgaa actcatgatg atgtcatctc 
gatgcctaca tctttggttc cttatcattc 
tagcatctct tcagtgggag gtccaaatgc 
tgatggtgct aatttacgcc agaggagtgc 
ggtcaccgta ccatcctccc cacccaccaa 
ggctgctgct tccttctttg cagccagtga 
ctcaaagaaa gatggaaaga tcattgacaa 
aagatgagaa agtgaaccaa aaaggctaga 
ttcccatggt gcatatgctt gtttaaagtg 
tctgtaattg gaagaacaga aagtgctggc 
gttctgcatt gactcattta ccagaattca 
agaaaaatat gttgctaaga ttgtgttggt 
caattgtgca tatcagctga ctttttgttt 



tgatacccgc tctagcctga catataagta 24 60 
caacagcaac aatgtcatcc aggaagaatt 2520 
gtctcaggtt tccaaaccct gtggtggagg 2580 
gaaaagtgat aataaaatgg tccatcgcag 2640 
tattagacga atgtgcaata ttcaagagtg 2700 
ggaacactgc accaaaacct gtggaagttc 2760 
tcagccactc cttgatggca ccaaccgctc 2820 
tcccgagagc cgccggccct gtaacagagt 2880 
ctggagtgag tgttcagtga cctgcggtga 2940 
ggctggggac cactgtgatg gtgaaaagcc 3000 
ttgtaatgat gaaccatgtt tgggagacaa 3060 
acgatactgc tccataccag gttataacaa 3120 
tagcaccctg ccaccaccat accttctaga 3180 
taaccctagt gacctcccta gatctctagt 3240 
agagacccct gcaaagaaga tgtctttgag 3300 
atatgctgct ttcaggccaa acagtaaacc 3360 
tcagcaagca ggaagtaaga ctgtgagact 3420 
gagggtccac ctcagttcag cttcacaaat 3480 
ttcaataggt gcttcttctc aggcaagaac 3540 
cagacgtccg acaagatcat ccaccttaga 3600 
aaccagagga aaacctggac aacctctctc 3660 
gaaatctcta tagatcgtca gctcatttta 3720 
tcactttcta gttgctttca tcctcctttt 3780 
ttggaagaaa tcaccaaaga ttattacaaa 3840 
cgctctctga agcagaaaag ggactggaac 3900 
gttttagaaa agttacagta aaaattaaaa 3960 
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agagatacca atggtttaca ctttaacaag 
gacttgtatt cctatttatc tatattagaa 
gtaaatactg tatattgcaa aaatcagtat 
tttactatat tacatttctg gatgttctag 
cattctatag gttaattttc aaagcagagt 
ctgacaatat aaagggtttt gttgaatcaa 
aagaaacaca aaagctatag atatacagat 
aatttaaagg attggtgtct tagtacactt 
atgaactcgt gcaagacaaa actgaaaccc 
ggtgtccttt gtttttgtgt gtgtgtgttc 
aaacagctat actcaatgta ctgtaataat 
ttgtatgttg gtagctgaga aaaatatcat 
agctttgggg ctttgaaggc aggttcaaga 
ttccatatgg ttcatgttca aatgttcaca 
taataattta tgtcagtagt caccttgctc 
agtagatgta aagtacttgt acatagaatt 
ttttttttct tgatagtatt tttatgtact 
ttggtaaact agagagacat aattagagat 
aagcaaacta ctacagccaa ctcaaaagct 
atcttgcatt tttagtagtt gatattaagt 
aaattgtatg gactcagcta gctgttcaat 
tttttgaaag agataagtgc atcatgaatt 
gcataatgat tttgaggtca gtacctgagc 
gtagatgaat taacctctca aagcacagaa 
taaactaaca gcttaaacta ttgactctat 
agaactttga aggtcaaatt aggcaaattc 



aaattttgga tatggaacaa agaattctta 4020 
atattgtatg agcaaatttg cagctgttgt 4080 
tattttaaga gatgtgttct caaatgattg 4140 
gtgcctgtcg ttgagtattg ccttgtttga 4200 
attacaaaag agaagttaga attacagcta 4260 
caatgtgata cgtaaattat agaaaaagaa 4320 
atcagcttac ctattgcctt ctatacttat 4380 
gtggtcacag ggatcaacga atagtaaata 4440 
tctttccagg acctcagtag gcaccgttga 4 500 
ttttttaatt ttcgcattgt tgacagatac 4560 
cgcaaaggaa aaagttttgg gataacttat 4 620 
cagtctagaa ttgatatttg agtatagtag 4 680 
aagcatatgt cgatggttga gatatttatt 4740 
accacaatgc atctgactgc aataatgtgc 4 800 
acagcaaagc cagaaatgct ctctccaggg 4860 
cagaactgaa gatatttatt aaaagttgat 4 920 
aaatatttac actaatatca attacatatt 4 980 
gcatgctttg ttctgtgcat agagaccttt 5040 
aaaactgaac aaatttgatg ttatgcaaac 5100 
tgatgacttg tttcccttca aggaaacatt 5160 
gaaattgtga attagaaaca tttttaaaag 5220 
acatgtacat gagaggagat agtgatatca 5280 
tgtctaaaaa tatattatac aaactaaaat 5340 
tgtgcaagaa cttttgcatt ttaatcgttg 5400 
acctctaaag aattgctgct actttgtgca 54 60 
cagatagtaa aacaatccct aagccttaag 5520 
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tctttttttt ttcctaaaaa ttcccataga ataaaattct ctctagttta cttgtgtgtg 5580 

catacatctc atccacaggg gaagataaag atggtcacac aaacagtttc cataaagatg 5640 

tacatattca ttatacttct gacctttggg ctttcttttc tactaagcta aaaattcctt 5700 

tttatcaaag tgtacactac tgatgctgtt tgttgtactg agagcacgta ccaataaaaa 5760 
tgttaacaaa atat 5774 

<210> 67 
<211> 5535 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 67 

ggactttaga agccgttgct gccctctctg tcacctgaag cggggccctc tcccatccca 60 

cccttgcccc gcctccctgc ccccaccggg ccggccctgc ccgccgccgg accctggcat 120 

gtcaagacct ggtccgcgcc tgcctgccca gcccgcggaa ccccggcggc cccgcgagct 180 

aggatgaggg gccaggccgc cgccccgggc cccgtctgga tcctcgcccc gctgctactg 240 

ctgctgctgc tgctgggacg ccgcgcgcgg gcggccgccg gagcagacgc ggggcccggg 300 

cccgagccgt gcgccacgct ggtgcaggga aagttcttcg gctacttctc cgcggccgcc 360 

gtgttcccgg ccaacgcctc gcgctgctcc tggacgctac gcaacccgga cccgcggcgc 420 

tacactctct acatgaaggt ggccaaggcg cccgtgccct gcagcggccc cggccgcgtg 480 

cgcacctacc agttcgactc cttcctcgag tccacgcgca cctacctggg cgtggagagc 540 

ttcgacgagg tgctgcggct ctgcgacccc tccgcacccc tggccttcct gcaggccagc 600 

aagcagttcc tgcagatgcg gcgccagcag ccgccccagc acgacgggct ccggccccgg 660 

gccgggccgc cgggccccac cgacgacttc tccgtggagt acctggtggt ggggaaccgc 720 

aaccccagcc gtgccgcctg ccagatgctg tgccgctggc tggacgcgtg tctggccggt 780 

agtcgcagct cgcacccctg cgggatcatg cagaccccct gcgcctgcct gggcggcgag 840 

gcgggcggcc ctgccgcggg acccctggcc ccccgcgggg atgtctgctt gagagatgcg 900 

gtggctggtg gccctgaaaa ctgcctcacc agcctgaccc aggaccgggg cgggcacggc 960 
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gccacaggcg gctggaagct gtggtccctg 
ggcctccaga cgcggacgcg cacctgcctg 
gagggggtgc tggaggaggg tcgccagtgc 
accagctccc ggagccagtc cctgcggtcc 
gacgagctgc agcagtttgg gttcccagcc 
tggtccccgt ggagcgtgtg ctccagcacc 
ttctgcgtgt cctcctccta cagcacgcag 
tgcaacaact ctgccgtgtg cccagtgcat 
ctctgctcca gcacctgtgg ccgtggcttt 
cagtttgggg gcaacccctg tgagggccct 
ctgtgccctg gccgggcagt ggatggaaac 
tccgccagct gctcccaggg ccgacagcag 
gggggtgcgg agtgccaggg ccactgggtg 
ccagtggatg gcaagtggca ggcctgggcg 
gctggcagcc agcgacggga gcgtgtctgc 
cagggccccc aggatgagta ccggcagtgc 
atctgtgatg aggacaactt tggtgctgtg 
gctgctgtcc ggtgtccccg caacgccaca 
gaggaaggca tcgcctactg ggagcccccc 
agaaacatcc agatgatgac ccgggagcac 
gagggggtct cggaggtcat ccagacactg 
agtggggacc tgctgtccac catcgatgtc 
gcgtactaca gccccacccc tggggacgta 
ttggcagagg agaatcggga caagtgggag 
gagctgttcc ggctggtgga ggactttgtg 
agggatgcat accaggtgac agacaacctg 



tggggcgaat gcacgcggga ctgcggggga 1020 
cccgcgccgg gcgtggaggg cggcggctgc 1080 
aaccgcgagg cctgcggccc cgctgggcgc 1140 
acagatgccc ggcggcgcga ggagctgggg 1200 
ccccagaccg gtgacccagc agccgaggag 1260 
tgcggcgagg gctggcagac ccgcacgcgc 1320 
tgcagcggac ccctgcgcga gcagcggctg 1380 
ggtgcctggg atgagtggtc gccctggagc 14 40 
cgggatcgca cgcgcacctg caggcccccc 1500 
gagaagcaaa ccaagttctg caacattgcc 1560 
tggaatgagt ggtcgagctg gagcgcctgc 1620 
cgcacgcgtg aatgcaacgg gccttcctac 1680 
gagacccgag actgcttcct gcagcagtgc 1740 
tcatggggca gttgcagcgt cacgtgtggg 1800 
tctgggccct tcttcggggg agcagcctgc 1860 
ggcacccagc ggtgtcccga gccccatgag 1920 
atctggaagg agaccccagc gggagaggtg 1980 
ggactcatcc tgcgacggtg tgagctggac 2040 
acctacatcc gctgtgtttc cattgactac 2100 
ctggccaagg ctcagcgagg gctgcctggg 2160 
gtggagatct ctcaggacgg gaccagctac 2220 
ctgaggaaca tgacagagat tttccggaga 2280 
cagaactttg tccagatcct tagcaacctg 234 0 
gaggcccagc tggcgggccc caacgccaag 2400 
gacgtcatcg gcttccgcat gaaggacctg 2460 
gttctcagca tccataagct cccagccagc 2520 
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ggagccactg acatcagctt ccccatgaag 
gtgccagagg acagggtcac tgtgtccaag 
gatgaagcat ccgtgtttgt ggtgggcacc 
gccctgcaga ggaacacgac cgtcctgaat 
ccgcctcgct ccctgcgcac acccttggag 
accaaccaga cctgtatcct gtgggatgag 
cagctcgggc cctggtcgtg gcgcggctgc 
cgctgcctct gtgaccggct ctccaccttc 
aacatggaga aggcgactct gccgtcggtg 
ctcaccctgc tcatgctggt catcatctac 
cgttctgtca tcctcatcaa cttctgcctg 
atcgggcaga cccagacccg caacaaggtg 
ttcttcttcc tgtcctcctt ctgctgggtg 
gtgacgggcc acctccggaa ccgcctcatc 
ctccctgcac tggttgtggc catttctgtg 
atgaactact gctggctctc cctggagggg 
gctgccgttg tgctggtgaa catggtcatt 
aaagacggca tcacggacaa gaagctgaag 
tgcgtggtgc tgccgctgct ggcgctgacc 
cgccgctccg ccctcttcca gatcctcttc 
atcgtcatgg tgcactgtat cctccgtaga 
gttgaccggc aggaggaggg caacggggac 
cagctcatga ccgacttcga gaaggacgtg 
gacatcgcgg cctgccgcac tgccaccatc 
gaggaggaga agctgaagct ggcccatgcc 
ccggccaacg tgtccaagct gcacctgcac 



ggctggcggg ccacgggtga ctgggccaag 2580 
agtgtcttct ccacggggct gacagaggcc 2640 
gtgctctaca ggaacctggg cagcttcctg 2700 
tctaaggtga tctccgtgac tgtgaaaccc 2760 
atcgagtttg cccacatgta taatggcacc 2820 
acggatgtac cctcctcctc cgcccccccg 2880 
cgcacggtgc ccctcgacgc cctccggacg 2940 
gccatcttag cccagctcag cgccgacgcg 3000 
acgctcatcg tgggctgtgg cgtgtcctct 3060 
gtgtccgtgt ggaggtacat tcgctcagag 3120 
tccatcatct cctccaatgc cctcatcctc 3180 
atgtgcacgc tggtggccgc cttcctgcac 3240 
ctcaccgagg cctggcagtc ctacatggcc 3300 
cgcaagcgct tcctctgcct gggctggggg 3360 
ggattcacca aggccaaagg gtacagcacc 3420 
ggactgctct atgccttcgt gggacctgcc 3480 
gggatcctgg tgttcaacaa gctcgtgtcc 3540 
gagcgggcag gggcctccct gtggagctcc 3600 
tggatgtcgg ctgtgctcgc cgtcaccgac 3660 
gctgtcttcg actcgctgga gggcttcgtc 3720 
gaggtccagg acgctgtgaa atgccgtgtg 3780 
tcagggggct ccttccagaa cggccacgcc 3840 
gatctggcct gtagatcagt gctgaacaag 3900 
acgggcacac tgaagcggcc gtctctgccc 3960 
aaggggccgc ccaccaattt caacagcctg 4020 
ggctcacccc gctatcccgg cgggcccctg 4080 
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cccgacttcc ccaaccactc actgaccctc aagagggaca aggcgcccaa gtcctccttc 4140 
gtcggtgacg gggacatctt caagaagctg gactcggagc tgagccgggc ccaggagaag 4200 
gctctggaca cgagctacgt gatcctgccc acggccacgg ccacgctgcg gcccaagccc 4260 
aaggaggagc ccaagtacag catccacatt gaccagatgc cgcagacccg cctcatccac 4 320 
ctcagcacgg cccccgaggc cagcctcccc gcccgcagcc cgccctcccg ccagcccccc 4380 
agcggcgggc cccccgaggc accccctgcc cagcccccac cgcctccgcc cccaccgcca 4440 
ccacctcccc agcagcccct gcccccaccg cccaatctgg agccggcacc ccccagcctg 4500 
ggggatcccg gggagcctgc cgcccatccg ggacccagca cggggcccag caccaagaac 4560 
gagaatgtcg ccaccttgtc tgtgagctcc ctggagcggc ggaagtcgcg gtatgcagaa 4 620 
ctggactttg agaagatcat gcacacccgg aagcggcacc aagacatgtt ccaggacctg 4 680 
aaccggaagc tgcagcacgc agcggagaag gacaaggagg tgctggggcc ggacagcaag 4740 
ccggaaaagc agcagacgcc caacaagagg ccctgggaga gcctccggaa agcccacggg 4 800 
acgcccacgt gggtgaagaa ggagctggag ccgctgcagc cgtcgccgct ggagcttcgc 4860 
agcgtggagt gggagaggtc gggcgccacg atcccgctgg tgggccagga catcatcgac 4 920 
ctccagaccg aggtctgagc gggtgggcgg cggccacgca ctgggccacg gaggagggat 4 980 
gctgctccgc ccgctcctgc cgcagacggg cacagacacg ctcgcgggca gcgggccagg 5040 
cccgcacccc ggcctcaggg cgctcagacg gcggccaggc acagggcccg cagtgctggg 5100 
accagagcca gatgcaggac aggaggcggc ccggccagcg ggcacagggc accagaggcc 5160 
gaaggtgcct cagactccgc cctcctcggg ccgaggccca gcgggcagat gggcggacgg 5220 
ctgtggaccg tggacaggcc cagcgcggcc agcgtcccag ggtacccgcc tgagctcctg 5280 
ctgcggagga gctgcctgct tggcccggcc ggcctggcac cgttttttaa acacccccat 5340 
ccctcgggaa gcagccagct ccccacacct tccagggccc taggcccctc ctagacccag 54 00 
gtggagggca cagccctccg accctcatgg cccccagggg caggactgag tcccctccag 54 60 
gaagaagcag gggggaatct attttttctc tccttttctt ttcttcaata aaaagaatta 5520 
aaaacccaaa aaaaa 5535 
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<210> 68 
<211> 398 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 68 

cggggcaacc cgctggagtg gacgggccag gtgacggtgc gcaagaagcg caagccctac 60 
tccaagttcc agacgctcga gctcgagaag gagttcctct tcaacgcgta cgtcagcaag 120 
cagaagcgct gggagctggc gcgcaacctc aacctcaccg agcgccaggt caagatctgg 180 
ttccagaacc ggcgcatgaa gaacaagaag aacagccagc gccaggcggc cagcagcagc 240 
agcagcaaca gcagcagcag cagcagcagc aacagcagca agcggccgcc ggcggggcgt 300 
cggccgccgc caacggccac cagggccacc aagcgcacca ccacgcgccc cccaacggcg 360 
ccgtcgcagc cctcaagcac caccagtgac ccgtagcg 398 

<210> 69 
<211> 8670 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 69 

cccgggtgcg gtgtcgtgtg tggggctggg cgccatgttc ctggacatgc tgagggccaa 60 
gcgcgacacg gcgcccgacc gccgccagct ggacgaccgg atgatggggg cggacccggg 120 
ggacatagcg gccaaggtga gggcagggtt ttgcgtgcgt gcttgattgt gcgtgtgcgt 180 
gcgtgcgtgc gtgcgtgcgg tgttgcgtgt gtatttgaac tgtgttttgt gtatgtactt 240 
aggggtaaga gtgcatacac atgcatgcga ccggtggcct tacaaatcaa caacacgtac 300 
gcctgcatgt atccaggtgg cagcgtggcg acgagcacgt ggcttcgagg gcccaggcac 360 
ggcgggcccc agcggcagcg ccgccagtgg cagcggcgcc agcggctcgg caccgcaggc 420 
gcgctcgccc cgacctcagc caccgcggcc gcgctcacct tcacgcgggt gaaccccggc 480 
gaggagccgc ccgtgtacgc gtgcgagcaa acaggtgcgt aagcgacgtg tgggcagcgc 540 
gaagaggcgt gggggcgaga gagcaaaggg actagggaaa cgcacagcca aatacggtat 600 
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gcgggcaacg aggcgatggc cctggaaatc 
aattgctggg cgactaccgt agtctactga 
actgcagtag tgccgttgcc agccgcgctg 
ggcccatgcc tcgcgcactc cgagcaccca 
ccgccgccct ctcccccgcc ttcacttctt 
ccgcttcctc aggctgggtg cacgtgtgcg 
ccgcagcgaa ctgctggtgt gcccggtgag 
agcgcgtgtt tctcagggcc tgtgtgggac 
gaactgtgta tgtggagtag cgtggcgtgt 
tgccccacgt tgcagagtca gctgacaacg 
gcacgcacgt cctccgcact cccaacaaat 
ctatgcccgc acacgctccc gcgccctcag 
cacgagtgcg aggaggatga ggcgggcggc 
ggaggacggc ctgggactgg gcatcacagg 
gggccttgta cgtgactgcg gttctgcatg 
gtacggcttg tgggcatgca gtgccttgac 
ttgccccgct ccctgcacac actgcagccg 
agtgcgagaa cgcgcagcag ctcaacaggc 
ccgcaggggg agtgtgttca tattgtggtt 
gtattgcacg cgtacagcat tggtgactgg 
aggcgctgca gcgagttgtg gggcatgcgt 
gttgagttgg cgtttggagg tgtgggacga 
agaaggcgtg accgttttac gagctcgtat 
atttttgtta caagcgacga atgcgtgggg 
ttgcgcatgc acacgtgacc aaatttatgc 
ttgtgtatcg gttggcactt ataacaattg 



gcagggccct tttgaaatcg tgtaaggcgc 660 
tgcattgcac tacttgtatt actgtatcct 720 
ctgccctttg gctcccttcc caatccaaat 780 
gagcacccag aagccgttgc gtgcgctccg 840 
aattaatcgt gaatgtaatc cccccccccc 900 
cgacgcctgc acggagggtg tggtggatgc 960 
tcgacgagga ggaggtgcaa gggggatacc 1020 
accgaaacgt ggtaaaagag acccgcccgc 1080 
gcggccggac cgacaaggca gcttgtggac 114 0 
acacgtgcgc cttcctgtca ttgcccgtgc 1200 
tgacagcgac acgtgcgcct tcctataagc 1260 
gtgtcgggcc agaccacaga ccggttggtc 1320 
tgcggcggcg ccggcggggc gccgcggcga 1380 
tgggtggcag gctggcaggg actcacgcat 1440 
gctagtggct cacgcgctgc gcacgttcac 1500 
gtgaggctgc gctgccttgc tgctgccgcc 1560 
gcttcgggcg ctacttcacc gcgggctacg 1620 
tgctggggta caaggcgctg tgagagcgcg 1680 
gtttgggccg tgggcgcggg ctgcatgtgc 1740 
tcaggtgtaa gcggccggca gtgcgccgcg 1800 
catgcgcaga cggcccctgg acgacaaggc 1860 
cgtggggttt gtgccgtcaa agcacagaac 1920 
gatgtagcat ggattgaata atgacatgtg 1980 
ttttggatgg caggggtttc agtcgcccga 2040 
tcaacgacgt gaccattgct ttatacatac 2100 
gctcgtcaaa ttgacgcgag gctgcacttc 2160 
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gatcctgaaa gccccagttc aacaagtcgg atagccaaat ggccccgctc gctctccagc 2220 
atcaaggggc ctctaagtgc ctcgcggcaa cccagcgcaa gtgtgctcgc gttgcggtga 2280 
gctggactcg tgcacttgtc gacgccgtcg gcaccgcaat cgaaagacgc gtgcgtcgag 2340 
caattgtgga agccgctgac gaattgtccg catgtgacat tgcaggctcg cgtccccgct 2400 
cgtctcagcg tcatggccca ggtgcggacg ttgggactgc acttgcacga atgtgatggg 24 60 
gccgcaccga gtctgcgcgg acgtctcgct gacgtttcgc gttgaatgca tctcgcaata 2520 
ggcagctgct gcgcctgctg acaacactaa gaagctgtgg ggcggtcgct tcacgggcaa 2580 
gacggacccg ctcatggaga agttcaacga gtcgctgccc tttgacaagc gcctgtgggc 2640 
tgaggacatc aaggtgcggc acagggaggg gggcgagtgg tggggtgggg ctggggggga 2700 
cgcgggtttg gtggccaggg cagggaggga agacgtgcgg ggctaggcaa gaggctgcga 2760 
gggcccaggg taacaccaga ccgtgccgtg tcgcgtgccc ggcttgctgc ccaccttgcc 2820 
cggccatccc caccgccctc cccaccagca atgacacgta cacattcaca cactccccca 2880 
cacccacata cccacacacc cacgcattcc ccaacagggc agccaggcgt acgccaaggc 294 0 
tcttgccaag gccggcattc tggcacatga cgaggccgtg accattgtgg aggggctggc 3000 
caaggtgcgc acacccggca gcagggcggg tgggtgggtg ggtggggtgg gggggcagag 3060 
agaggcgcgg gctgagaggg ggctgagagg ggggtcagcg aggcgcaggc tcagggggag 3120 
gcgtctgagg ggggctgaga tggtggtggg ggagctgcgg gtgctggggc tgctgcggtg 3180 
gcgggcgggc gggcgggcgg gcgacgtgta cgtgagtagc cgctgaccgg gcgctgggcc 3240 
tttgcgcacg ccacagccca catgacaccg ccgcaaggcc cgccgcgccc cacccacgtt 3300 
cacacactcc ccacacccac gcgtgcgcgc gcctccttcc cctcaataca cgcgcctcct. 3360 
tcccctggcc cccgcctgct ccccccatcc ggccgccccg cctgcaggtg gctgaggagt 3420 
ggaaggcggg tgcctttgtg atcaaggcgg gtgacgagga catccacacg gccaacgagc 34 80 
ggcgcctcac ggagctggtg ggggcggtgg gcggcaagct gcacaccggc cgctcgcgca 3540 
acgaccaggt gagggtgggt gggtgggggt ggggtgggtg ggtgggtggg tgggtgggtg 3600 
ggtgggtggg tgggtgggtg ggtgggtggg ggtttgagat accggtacca ggccaaacta 3660 
aaccgaaccc aagggggtgg cgtaggggcg tgggaggggg ggagtgcgga agccgggagg 3720 
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caggagtaag ggcgggagga gggggccgga ggagaagcag ggacgaagtc gatgacaggc 3780 
gcagtcggtg gcggcggtgg cgggtgtgcc gttgtgcagt ggctgtggag gccatgtgca 3840 
gggcggcggc ggggccgggc cgggggtggg agacttgtcc agaccccgtg gccctcttcc 3900 
agccccgtcc gccactgccg ccaccaccac cgccgccgcc gtagccacca cccctcacgt 3960 
cgaggcactt cacagatgcg aagcaaccac accgttctcc acatgaacag ctaccctccc 4020 
aaacccaact ttcccttccc gccttaccta accatgaccc gctacccccc ccccctttat 4080 
ttcttaacta accatgaatg cccccccccg gctgtacctg gctacgactt cacttcgtaa 4140 
acttaatgtg tgtaaccccc cttacacaca cacacacacc cctccccgcc cctccaaagg 4200 
ttgccaccga ctaccggctg tggctggtgg gtcaggtgga ggtgatgcgg tccgaggtgg 4260 
gcgagctgat gcgcgtggcg gcggaccgct ccgaggcaga ggtggaggtg ctcatgccgg 4320 
gtgagggggc agggaggggg ggagggggag ggggaggtgc tcatgccggt gagggtaggg 4380 
aggggagggg cagaggaggg agggggagga gggggcggct gagtgcggga gaggcaggga 4440 
tgagggcgat agaaagttgc gtattgtcgg taaactcaaa ggactagacg aagagaacaa 4500 
acctaaacaa gggagctgga gcgaggccaa atctgaacgt gacatcgccc gcctcctccc 4560 
gctgcctgct cccccacctc ctcccccatc tcgccccccc ccccacacac acacaggctt 4 620 
cacgcacctt cagaatgcca tgactgtgcg ctggagccac tggctgatga gccacgccgc 4 680 
ggcctggcag cgcgacgaca tgcggctgcg ggacctgctg ccgcgggtgg ccacactgcc 4740 
gctgggctcg ggtgggtgag ggaggggagg ggaggggagg gggggagggg gagggagagg 4800 
aggggagaag ggggggggag acgaggaggg tggaagggtg ggggcggggc ggtggaggct 4860 
agagggtggg gctgggtggg tggacggagt gcactggtag aggagggata gggtacattg 4 920 
agacgggagg agggatgcag gggcgaaggt ggggaggagg ggaggggagg aggcgtggag 4 980 
ctggagtggg ccgacgagtg tgcggacggg gcaggcggca acggggatta aacggcgggg 5040 
ggccggggcg tgtgcacgac aggggcttgc gcgtctgcga ttgtgggggc acacagggac 5100 
aggagcacga cgtgggacac gcatagatac gccgcattga caacacacac acacacacac 5160 
acacacacac acacacacac acacacacaa acacaaacac acacaaacac aaacacacac 5220 
acgccccccc ccctacacac acgccccctc cccaggcgcc ctggccggca acccctttct 5280 
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ggtggaccgc cagttcatcg ccaaggagtt 
catggacgcg gtgaggggag gaggaggggg 
gaggaggggg ggagggggtt aactttgaag 
aggagggggc ctggaggagg gggggaggag 
gagggggagg attgggaggg ggctggggga 
gagggggttg ctgctggtgt aaagggcctg 
gtactgcctg gggccccgcg ctgcagaggt 
tccccacacg cgggccgctg ccgctgcccg 
aggatataca cacacacaca cacacacaca 
cacgggcgcg ggctttcgtt tcgtttttta 
aacacactcc atctttctca cacaaacaca 
cgcgactttg tgatcgagac ggtgtttgcg 
tgggcggagg acctcatcat ctacagctcc 
gcctacgcca ccggctcctc gctcatgccg 
atcaggtgcg ggagggatgg ggtgggggtg 
tgaaggcgta gggggtggat ggggtgggtt 
tagccaggaa cagtagtaga gcagacgcgt 
gttgggtaaa gcggtacagg atgtactgag 
agcaggcagg cggggcgagg ggcgtggggg 
aacacgtgac acggagatgc accacacaaa 
agggctgaga ggcgcgtgtg gaggggtgcg 
gctgcctcca cctcctccac cgcctcctgc 
cctcctcctg ctgcaggggc aagggcggtc 
cggtgctcaa gggcacgccc accacataca 
gagggaggga gggagagcga gggagaggga 
cagggacagg gacagggaca gggacaggga 



gggtttcggc ggcggcgtgt gccccaactc 5340 
aggagggcgg gggggggcag gaggggggag 5400 
cgtaaggaaa cagtcgggag gaggggggga 54 60 
gagggtggct ggagggggct gggggaggag 5520 
gggtgcccgc agctggggga ggtggggagg 5580 
taggcactga gagcactgtg gggagccggg 5640 
gtcgcgcagt gtggcggcgc atcccccgca 5700 
ccacaccctt gccactttgt gtgctttcct 57 60 
cacacacaca cacacacaaa cacaaacaca 5820 
acacaaacac acactccccc tgtgctcctc 5880 
cacgcacaca cacatgcgca ggtgtctgac 5940 
gccagcctgc tgtgcgtgca cctgtcgcgc 6000 
ggccccttcg gctacgtgca gtgcagcgac 6060 
cagaagaaga accccgacgc cctggagctc 6120 
ggggggttac attcatggtt agttaagaag 6180 
acattcatga acatttaaga agtgaaggcg 6240 
tgtagtgtgt gggtttgggt gggagggatg 6300 
gactgcagac cgaaggagcg ggggaggggg 6360 
cgggggttac tggcaccgtg ccgggtaagc 6420 
gagggacgtg gggagtggca ggcgggggcc 6480 
gggttgggcg gggggctgtt tcatgatacc 6540 
cacctccacc tcccccactg cccctccccg 6600 
gtgtgcaggg caacctgatg ggcgtcatgg 6660 
acaaggactt ccaggcgaga gagcgagagc 6720 
gggagaggga gggagaggga gacagaggga 6780 
cagggacagg ggcaggggca ggggcagggg 6840 
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caggggcagg ggcaggggag gccccccggg 
taggggcgct gcactgaggc cgcgaggcgg 
gggagcggac atgcgccgca aacacagacg 
tgggcttact gacaattcat cattgtgcgc 
ttgtgcagga gtgttgggag ctgctgtttg 
gcatcgccac cggcgtgctg tccaccctgc 
agcgtagccg agcagggctg gagcagcagc 
agcgggagcg ggagcagcag gaggggtggt 
aggaagggaa ggaggagcag gagcaggagg 
ggggtcgtgt ccttggccgc atgggcggag 
gagcctgcac ccacgctccg cgggtcccta 
ctcttgcgcc caccccctta gtgcatcctg 
cctgacctcc tcgccaaacc tcccccgccc 
gacttggccg agtacctggt gcgcaagggc 
tgcggccggg cgggagggcg tgagggcgtg 
tcgaacgttg tgccgcgcct gtttcacaat 
catatgtata acaaactgac caccaatcat 
gcaccacgcc accacagcgg cgccgccgtg 
ttcgacctca ccgtggacga cctcaagacc 
gcggtgagcg gcggcgcgga gcagcagcag 
tagcctgggg gggagcgtgt gggaggaacg 
gcagccgagc gcacgtggtg ctttgcccca 
atgctggtgt gctgctaggt ggttcacacc 
ccttactgtg gccatgtgaa cggcatcctc 
ccacaacccc gcaccaccca gctcctcaat 
cgcaccaaca acacgacgca tgtgtttgcc 
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ggcggcgggc ccggggcatg aggtcagaca 6900 
gcgggaggca gggggcgggg ggcggggggc 6960 
ggttgagaaa gcacaacgac tggaacgcag 7020 
atatgtgtgt atgtgtatgt gtgtgtttgt 7080 
acacggtgga cacggtgcac gacgtggtgc 7140 
ggatcaagcc cgaccgcatg aaggccggtg 7200 
cgggcagcag tagcagcagg gcaggggagc 7260 
tgggaagcgg tgggggtagg gtgggagcgg 7320 
aagaggagga ggaagggcgg tggggggtgg 7380 
gcggggaggc ggggaggagg cggggaagca 7440 
ccgtcttgcg cctaaccccg tgcgcctagc 7500 
tacccctctt tccaaacatc cttgcaactc 7560 
ccaggcctgt ccgccgacat gctcgccacg 7 620 
gtgccgttcc gggagacaca ccaccacagg 7 680 
ggtggggcat gcccggggtt gtgagagcta 7740 
gtcgggccac agggtatgca gtttcctctc 7800 
gcacgctcac acgctctccc acacacacgc 7860 
aagatggccg aggaccgcgg ctgcacgctg 7 920 
atccacccgc tcttcaccga cgacgtggcg 7980 
cagcagcagc agcagcagca gcagcagcag 8040 
gcgggggagg ggaggcgggg ggtgtcgttt 8100 
ttccatgcca gcagggtgac acacctgacc 8160 
tacgtgtgaa tttgtgctgg cgtgcgcaca 8220 
atgtcctcgt gattgcgccc ggcacattgc 8280 
ccagtgcaag gaaaggaaat gcacgcccgc 8340 
acgtgcgcgc acacacgcgc aggtgtggga 84 00 
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cttcaaccgc agcgccgaga tgcgcgacac ggagggcggc accagcaagc gctcggtgct 8460 

ggagcaggtg cagaagatgc gcacctacct ggcggcggag ggacagcact gagcgggtcg 8520 

ggggaggggg ggcgggtgtg tatgtgtgtg tgtgtgcgtg tgtaagtctc ggtggagggg 8580 

tggtcctcta tatggcggcg gggccacagg gggacgggtg tgacagagtt acggccggag 8640 
ccagcggagt cccgggatgg attaaggatc 8670 

<210> 70 
<211> 745 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 70 

atgagatggc gacgcgcccc gcgccgctcc gggcgtcccg gcccccgggc ccagcgcccc 60 
ggctccgccg cccgctcgtc gccgccgctg ccgctgctgc cactactgct gctgctgggg 120 
accgcggccc tggcgccggg ggcggcggcc- ggcaacgagg cggctcccgc gggggcctcg 180 
gtgtgctact cgtccccgcc cagcgtggga tcggtgcagg agctagctca gcgcgccgcg 240 
gtggtgatcg agggaaaggt gcacccgcag cggcggcagc agggggcact cgacaggaag 300 
gcggcggcgg cggcgggcga ggcaggggcg tggggcggcg atcgcgagcc gccagccgcg 360 
ggcccacggg cgctggggcc gcccgccgag gagccgctgc tcgccgccaa cgggaccgtg 420 
ccctcttggc ccaccgcccc ggtgcccagc gccggcgagc ccggggagga ggcgccctat 4 80 
ctggtgaagg tgcaccaggt gtgggcggtg aaagccgggg gcttgaagaa ggactcgctg 540 
ctcaccgtgc gcctggggac ctggggccac cccgccttcc cctcctgcgg gaggctcaag 600 
gaggacagca ggtacatctt cttcatggag cccgacgcca acagcaccag ccgcgcgccg 660 
gccgccttcc gagcctcttt cccccctctg gagacgggcc ggaacctcaa gaaggaggtc 720 
agccgggtgc tgtgcaagcg gtgcg 74 5 



<210> 71 
<211> 1986 
<212> DNA 
<213> Unknown 
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<220> 

<223> Description of Unknown Organism: Unknown 
<400> 71 

gaattccttt tttttttttt ttttttcttt ttttttttgc ccttatacct cttcgccttt 60 
ctgtggttcc atccacttct tccccctcct cctcccataa acaactctcc tacccctgca 120 
cccccaataa ataaataaaa ggaggagggc aaggggggag gaggaggagt ggtgctgcga 180 
ggggaaggaa aagggaggca gcgcgagaag agccgggcag agtccgaacc gacagccaga 240 
agcccgcacg cacctcgcac catgagatgg cgacgcgccc cgcgccgctc cgggcgtccc 300 
ggcccccggg cccagcgccc cggctccgcc gcccgctcgt cgccgccgct gccgctgctg 360 
ccactactgc tgctgctggg gaccgcggcc ctggcgccgg gggcggcggc cggcaacgag 420 
gcggctcccg cgggggcctc ggtgtgctac tcgtccccgc ccagcgtggg atcggtgcag 480 
gagctagctc agcgcgccgc ggtggtgatc gagggaaagg tgcacccgca gcggcggcag 540 
cagggggcac tcgacaggaa ggcggcggcg gcggcgggcg aggcaggggc gtggggcggc 600 
gatcgcgagc cgccagccgc gggcccacgg gcgctggggc cgcccgccga ggagccgctg 660 
ctcgccgcca acgggaccgt gccctcttgg cccaccgccc cggtgcccag cgccggcgag 720 
cccggggagg aggcgcccta tctggtgaag gtgcaccagg tgtgggcggt gaaagccggg 780 
ggcttgaaga aggactcgct gctcaccgtg cgcctgggga cctggggcca ccccgccttc 840 
ccctcctgcg ggaggctcaa ggaggacagc aggtacatct tcttcatgga gcccgacgcc 900 
aacagcacca gccgcgcgcc ggccgccttc cgagcctctt tcccccctct ggagacgggc 960 
cggaacctca agaaggaggt cagccgggtg ctgtgcaagc ggtgcgcctt gcctccccaa 1020 
ttgaaagaga tgaaaagcca ggaatcggct gcaggttcca aactagtcct tcggtgtgaa 1080 
accagttctg aatactcctc tctcagattc aagtggttca agaatgggaa tgaattgaat 1140 
cgaaaaaaca aaccacaaaa tatcaagata caaaaaaagc cagggaagtc agaacttcgc 1200 
attaacaaag catcactggc tgattctgga gagtatatgt gcaaagtgat cagcaaatta 1260 
ggaaatgaca gtgcctctgc caatatcacc atcgtggaat caaacgctac atctacatcc 1320 
accactggga caagccatct tgtaaaatgt gcggagaagg agaaaacttt ctgtgtgaat 1380 
ggaggggagt gcttcatggt gaaagacctt tcaaacccct cgagatactt gtgcaagtgc 1440 



WO 99/37660 



PCI7US99/01313 



-229- 



ccaaatgagt ttactggtga tcgctgccaa aactacgtaa tggccagctt ctacagtacg 1500 



tgctgcatct cccctcagat tccacctaga gctagatgtg tcttaccaga tctaatattg 1620 

actgcctctg cctgtcgcat gagaacatta acaaaagcaa ttgtattact tcctctgttc 1680 

gcgactagtt ggctctgaga tactaatagg tgtgtgaggc tccggatgtt tctggaattg 1740 

atattgaatg atgtgataca aattgatagt caatatcaag cagtgaaata tgataataaa 1800 

ggcatttcaa agtctcactt ttattgataa aataaaaatc attctactga acagtccatc 1860 

ttctttatac aatgaccaca tcctgaaaag ggtgttgcta agctgtaacc gatatgcact 1920 

tgaaatgatg gtaagttaat tttgattcag aatgtgttat ttgtcacaaa taaacataat 1980 

aaaagg 1986 



<210> 72 
<211> 2003 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (31) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (32) 

<223> May be any nucleic acid 
<400> 72 

ggaattcctt tttttttttt tttttttctt nntttttttt tgcccttata cctcttcgcc 60 
tttctgtggt tccatccact tcttccccct cctcctccca taaacaactc tcctacccct 120 
gcacccccaa taaataaata aaaggaggag ggcaaggggg gaggaggagg agtggtgctg 180 
cgaggggaag gaaaagggag gcagcgcgag aagagccggg cagagtccga accgacagcc 240 
agaagcccgc acgcacctcg caccatgaga tggcgacgcg ccccgcgccg ctccgggcgt 300 



tccactccct 



ttctgtctct gcctgaatag 



gagcatgctc 



agttggtgct gctttcttgt 1560 



cccggccccc gggcccagcg ccccggctcc gccgcccgct cgtcgccgcc gctgccgctg 360 
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ctgccactac tgctgctgct ggggaccgcg 
gaggcggctc ccgcgggggc ctcggtgtgc 
caggagctag ctcagcgcgc cgcggtggtg 
cagcaggggg cactcgacag gaaggcggcg 
ggcgatcgcg agccgccagc cgcgggccca 
ctgctcgccg ccaacgggac cgtgccctct 
gagcccgggg aggaggcgcc ctatctggtg 
gggggcttga agaaggactc gctgctcacc 
ttcccctcct gcgggaggct caaggaggac 
gccaacagca ccagccgcgc gccggccgcc 
ggccggaacc tcaagaagga ggtcagccgg 
caattgaaag agatgaaaag ccaggaatcg 
gaaaccagtt ctgaatactc ctctctcaga 
aatcgaaaaa acaaaccaca aaatatcaag 
cgcattaaca aagcatcact ggctgattct 
ttaggaaatg acagtgcctc tgccaatatc 
tccaccactg ggacaagcca tcttgtaaaa 
aatggagggg agtgcttcat ggtgaaagac 
tgcccaaatg agtttactgg tgatcgctgc 
acgtccactc cctttctgtc tctgcctgaa 
tgttgctgca tctcccctca gattccacct 
ttgactgcct ctgcctgtcg catgagaaca 
ttcgcgacta gttggctctg agatactaat 
ttgatattga atgatgtgat acaaattgat 
aaaggcattt caaagtctca cttttattga 
atcttcttta tacaatgacc acatcctgaa 



-230- 

gccctggcgc cgggggcggc ggccggcaac 420 
tactcgtccc cgcccagcgt gggatcggtg 480 
atcgagggaa aggtgcaccc gcagcggcgg 540 
gcggcggcgg gcgaggcagg ggcgtggggc 600 
cgggcgctgg ggccgcccgc cgaggagccg 660 
tggcccaccg ccccggtgcc cagcgccggc 720 
aaggtgcacc aggtgtgggc ggtgaaagcc 780 
gtgcgcctgg ggacctgggg ccaccccgcc 840 
agcaggtaca tcttcttcat ggagcccgac 900 
ttccgagcct ctttcccccc tctggagacg 960 
gtgctgtgca agcggtgcgc cttgcctccc 1020 
gctgcaggtt ccaaactagt ccttcggtgt 1080 
ttcaagtggt tcaagaatgg gaatgaattg 1140 
atacaaaaaa agccagggaa gtcagaactt 1200 
ggagagtata tgtgcaaagt gatcagcaaa 1260 
accatcgtgg aatcaaacgc tacatctaca 1320 
tgtgcggaga aggagaaaac tttctgtgtg 1380 
ctttcaaacc cctcgagata cttgtgcaag 1440 
caaaactacg taatggccag cttctacagt 1500 
taggagcatg ctcagttggt gctgctttct 1560 
agagctagat gtgtcttacc agatctaata 1620 
ttaacaaaag caattgtatt acttcctctg 1680 
aggtgtgtga ggctccggat gtttctggaa 1740 
agtcaatatc aagcagtgaa atatgataat 1800 
taaaataaaa atcattctac tgaacagtcc 1860 
aagggtgttg ctaagctgta accgatatgc 1920 
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acttgaaatg atggtaagtt aattttgatt cagaatgtgt. tatttgtcac aaataaacat 1980 
aataaaagga aaaaaaaaaa aaa 2003 



<210> 


73 


<211> 


957 


<212> 


DNA 


<213> 


Unknown 


<220> 




<223> 


Description of Unknown Organism : Unknown 


<220> 




<221> 


UNSURE 


<222> 


(809) 


<223> 


May be any nucleic acid 


<220> 




<221> 


UNSURE 


<222> 


(810) 


<223> 


May be any nucleic acid 


<220> 




<221> 


UNSURE 


<222> 


(811) 


<223> 


May be any nucleic acid 



<400> 73 

tctcgcccca actttttccc ccgcgctccg cagcagcagc agcagcagca gcagcagcag 60 
caaaatggca gacctcttca gcggactcgt gggcggcgtc gtcggcgctg ttgctgcagc 120 
agatttgcct gcggagggcg agagggcccc ccgccccgcc cccggcactg cctggacttg 180 
ctgctgcagc aaactgcaag aaggggcccg cgagctggag ggttttgtgc agcagctgag 240 
ttttgttgca gggaagctgg cctgctgcct gcgggtgggg gcggagcagc tggcgcgctg 300 
cgctgcggag gggcggctgc ccagcagcag cagcagcagc agctgctgcg cgctgctgca 360 
gctcgagaag caggacctcg agcagagcct cgaggccggc aagcagggcg cggagtgcct 420 
cttgaggagc agcaaactgg ccctcgaggc cctcctcgag ggggcccgcg ttgcagcaac 480 
gcggggtttg ctgctggtcg agagcagcaa agacacggtg ctgcgcagca ttccccacac 540 
ccaggagaag ctggcccagg cctacagttc tttcctgcgg ggctaccagg gggcagcagc 600 
ggggaggtct ctgggctacg gggcccctgc tgctgcttac ggccagcagc agcagcccag 660 
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cagctacggg gcgccccccg cctccagcca gcagccctcc ggcttcttct ggtagccctg 720 
cagcagcagc agcagcagca gcagcagcag cagcgcgggc ggcagccgcg gcggggccgg 780 
ggcgccgctg cagcaacagc agcagccgnn ncggctagcg ccgcggagca ctcgcaggga 840 
actccacagg cagcgggaga gcagcaggga cgagaagcag gtcatgtagc gcaggcagca 900 
gcgccagctg cagcagcagc agcagcagca gcagcagcag cagcagctcc tgcaccg 957 



<210> 74 
<211> 957 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (809) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (810) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (811) 

<223> May be any nucleic acid 

tctcgcccca actttttccc ccgcgctccg cagcagcagc agcagcagca gcagcagcag 60 
caaaatggca gacctcttca gcggactcgt gggcggcgtc gtcggcgctg ttgctgcagc 120 
agatttgcct gcggagggcg agagggcccc ccgccccgcc cccggcactg cctggacttg 180 
ctgctgcagc aaactgcaag aaggggcccg cgagctggag ggttttgtgc agcagctgag 240 
ttttgttgca gggaagctgg cctgctgcct gcgggtgggg gcggagcagc tggcgcgctg 300 
cgctgcggag gggcggctgc ccagcagcag cagcagcagc agctgctgcg cgctgctgca 360 
gctcgagaag caggacctcg agcagagcct cgaggccggc aagcagggcg cggagtgcct 420 
cttgaggagc agcaaactgg ccctcgaggc cctcctcgag ggggcccgcg ttgcagcaac 480 
gcggggtttg ctgctggtcg agagcagcaa agacacggtg ctgcgcagca ttccccacac 540 
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ccaggagaag 


ctggcccagg 


cctacagttc 


tttcctccgg 


ggctaccagg 


gggcay cagt- 




ggggaggtct 


ctgggctacg 


gggcccctgc tgctgcttac 


ggccagcagc 


agcagcccag 


DOU 


cagctacggg 


gcgccccccg 


cctccagcca 


gcagccctcc 


ggcttcttct 


ggtagccctg 


720 


cagcagcagc 


agcagcagca 


gcagcagcag 


cagcgcgggc 


ggcagccgcg 


gcggggccgg 


780 


ggcgccgctg 


cagcaacagc 


agcagccgnn 


ncggctagcg 


ccgcggagca 


ctcgcaggga 


840 


actccacagg 


cagcgggaga 


gcagcaggga 


cgagaagcag 


gtcatgtagc 


gcaggcagca 


900 


gcgccagctg 


cagcagcagc 


agcagcagca 


gcagcagcag 


cagcagctcc 


tgcaccg 


957 



<210> 75 
<211> 1089 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<220> 

<221> UNSURE 
<222> (376) 

<223> May be any nucleic acid 

<220> . 
<221> UNSURE 
<222> (377) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (847) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (848) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (849) 

<223> May be any nucleic acid 
<220> 

<221> UNSURE 
<222> (850) 

<223> May be any nucleic acid 
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<400> 75 

gaattccctc caactcttcg cgactctctc tctctcgccc caactttttc ccccgcgccc 60 
cgcagcagca gcagcagcag cagcagcaaa atggcagacc tcttcagcgg actcgtgggc 120 
ggcgtcgtcg gcgctgttgc tgcagcagat ttgcctgcgg agggcgagag ggccccccgc 180 
cccgcccccg gcactgcctg gacttgctgc tgcagcaaac tgcaagaagg ggcccgcgag 240 
ctggagggtt ttctgcagca gctgagtttt gttgcaggga agctggcctg ctgcctgcgg 300 
qtgggggcgg agcagctggc gcgctgcgct gcggaggggc ggctgcccag cagcagcagc 360 
agcagcagct gctgcnngct gctgcagctc gagaagcagg acctcgagca gagcctcgag 420 
gccggcaagc agggcgcgga gtgcctcttg aggagcagca aactggccct cgaggccctc 480 
ctcgaggggg cccgcgttgc agcaacgcgg ggtttgctgc tggtcgagag cagcaaagac 540 
acggtgctgc gcagcattcc ccacacccag gagaagctgg ctcaggccta cagttctttc 600 
ctgcggggct accagggggc agcagcgggg aggtctctgg gctacggggc ccctgctgct 660 
gcttacggcc agcagcagca gcccagcagc tacggggcgc cccccgcctc cagccagcag 720 
ccctccggct tcttctggta gccctgcagc agcagcagca gcagcagcag cagcagcagc 780 
ggcggcggca gccgcggcgg ggccggggcg ccgctgcagc aacagcagca gccgcggcgg 840 
ctagcgnnnn gagcactcgc agggaactcc acaggcagcg ggagagcagc agggacgaga 900 
agcaggtcta tgtagcgcag gcagcagcgc cagctgcagc agcagcagca gcagcagcag 960 
cagcagcagc agctcctgca ccgcagcgtt gtgtcattta ttacgttggc agctctgagg 1020 
cctcggcgca gccaacgcgc ctcaggtatc tttcagactc ttttctctaa ggtcttccag 1080 

1089 

acggaattc 



<210> 76 
<211> 1985 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism -.Unknown 



<400> 76 

cgccgagctt tcggcacctc tgccgggtgg taccgagcct tcccggcgcc ccctcctctc 
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ctcccaccgg cctgcccttc cccgcgggac tatcgccccc acgtttccct cagccctttt 120 
ctctcccggc cgagccgcgg cggcagcagc agcagcagca gcagcaggag gaggagcccg 180 
gtggcggcgg tggccgggga gcccatggcg tacagtcaag gaggcggcaa aaaaaaagtc 240 
tgctactact acgacggtga tattggaaat tattattatg gacagggtca tcccatgaag 300 
cctcatagaa tccgcatgac ccataacttg ctgttaaatt atggcttata cagaaaaatg 360 
gaaatatata ggccccataa agccactgcc gaagaaatga caaaatatca cagtgatgag 420 
tatatcaaat ttctacggtc aataagacca gataacatgt ctgagtatag taagcagatg 4 80 
catatattta atgttggaga agattgtcca gcgtttgatg gactctttga gttttgtcag 540 
ctctcaactg gcggttcagt tgctggagct gtgaagttaa accgacaaca gactgatatg 600 
gctgttaatt gggctggagg attacatcat gctaagaaat acgaagcatc aggattctgt 660 
tacgttaatg atattgtgct tgccatcctt gaattactaa agtatcatca gagagtctta 720 
tatattgata tagatattca tcatggtgat ggtgttgaag aagcttttta tacaacagat 780 
cgtgtaatga cggtatcatt ccataaatat ggggaatact ttcctggcac aggagacttg 840 
agggatattg gtgctggaaa aggcaaatac tatgctgtca attttccaat gtgtgatggt 900 
atagatgatg agtcatatgg gcagatattt aagcctatta tctcaaaggt gatggagatg 960 
tatcaaccta gtgctgtggt attacagtgt ggtgcagact cattatctgg tgatagactg 1020 
ggttgtttca atctaacagt caaaggtcat gctaaatgtg tagaagttgt aaaaactttt 1080 
aacttaccat tactgatgct tggaggaggt ggctacacaa tccgtaatgt tgctcgatgt 1140 
tggacatatg agactgcagt tgcccttgat tgtgagattc ccaatgagtt gccatataat 1200 
gattactttg agtattttgg accagacttc aaactgcata ttagtccttc aaacatgaca 1260 
aaccagaaca ctccagaata tatggaaaag ataaaacagc gtttgtttga aaatttgcgc 1320 
atgttacctc atgcacctgg tgtccagatg caagctattc cagaagatgc tgttcatgaa 1380 
gacagtggag atgaagatgg agaagatcca gacaagagaa tttctattcg agcatcagac 1440 
aagcggatag cttgtgatga agaattctca gattctgagg atgaaggaga aggaggtcga 1500 
agaaatgtgg ctgatcataa gaaaggagca aagaaagcta gaattgaaga agataagaaa 1560 
gaaacagagg acaaaaaaac agacgttaag gaagaagata aatccaagga caacagtggt 1620 
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gaaaaaacag ataccaaagg aaccaaatca gaacagctca gcaacccctg aatttgacag 1680 
tctcaccaat ttcagaaaat cattaaaaag aaaatattga aaggaaaatg ttttcttttt 1740 
gaagacttct ggcttcattt tatactactt tggcatggac tgtatttatt ttcaaatggg 1800 
acttttt.cgt ttttgttttt ctgggcaagt tttattgtga gattttctaa ttatgaagca 1860 
aaatttcttt tctccaccat gctttatgtg atagtattta aaattgatgt gagttattat 1920 
gtcaaaaaaa ctgatctatt aaagaagtaa ttggcctttc tgagctgaaa aaaaaaaaaa 1980 
aaaag 1985 

<210> 77 
<211> 476 
<212> DNA 
<213> Unknown 

<220> 

<223> Description of Unknown Organism: Unknown 
<400> 77 

ccaccctcct ccccctcccc cggccacttc gctaacttgg tggctgttgt gatgcgtatt 60 
cctgtagatc cgagcaccag ccggcgcttc agccccccct ccagcagcct gcagcccggc 120 
aaaatgagcg acgtgagccc ggtggtggct gcgcaacagc agcagcaaca gcagcagcag 180 
caacagcagc agcagcagca gcaacagcag cagcagcagc aggaggcggc ggcggcggct 240 
gcggcggcag cggcggctgc ggcggcggca gctgcagtgc cccggttgcg gccgccccac 300 
gacaaccgca ccatggtgga gatcatcgcc gaccacccgg ccgaactcgt ccgcaccgac 360 
agccccaact tcctgtgctc ggtgctgccc tcgcactggc gctgcaacaa gaccctgccc 420 
gtggccttca aggtaagagg ctaccccgcc ccccgccccc ggccgggagc ggcgga 47 6 



<210> 78 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA Primer 



<400> 78 

gcattttgga tccgcctttt catg 



24 



WO 99737660 
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<210> 79 ' * 

<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA Primer 
<400> 79 

gttgtgtgct gcagattgtt cc 



<210> 80 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA Primer 
<400> 80 

gaaaaatggg gatccgaggt g 



<210> 81 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA Primer 
<400> 81 

gcaggagaat tccgtccatg 



<210> 82 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (3) 

<223> Can be any amino acid 



<400> 82 

Trp Ser Xaa Trp Ser 
1 5 
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<210> 83 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<400> 83 

Cys Ser Val Thr Cys Gly 
1 5 



<210> 84 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (4) 

<223> Can be any amino acid 
<400> 84 

Gly Cys Gin Xaa Arg 
1 5 



<210> 85 

<211> 733 

<212> DNA 

<213> Homo sapiens 

<400> 85 

gggatccgga gcccaaatct tctgacaaaa ctcacacatg cccaccgtgc ccagcacctg 60 
aattcgaggg tgcaccgtca gtcttcctct tccccccaaa acccaaggac accctcatga 120 
tctcccggac tcctgaggtc acatgcgtgg tggtggacgt aagccacgaa gaccctgagg 180 
tcaagttcaa ctggtacgtg gacggcgtgg aggtgcataa tgccaagaca aagccgcggg 24 0 
aggagcagta caacagcacg taccgtgtgg tcagcgtcct caccgtcctg caccaggact 300 
ggctgaatgg caaggagtac aagtgcaagg tctccaacaa agccctccca acccccatcg 360 
agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac accctgcccc 420 
catcccggga tgagctgacc aagaaccagg tcagcctgac ctgcctggtc aaaggcttct 480 
atccaagcga catcgccgtg gagtgggaga gcaatgggca gccggagaac aactacaaga 540 
ccacgcctcc cgtgctggac tccgacggct ccttcttcct ctacagcaag ctcaccgtgg 600 
acaagagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcat gaggctctgc 660 
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acaaccacta cacgcagaag agcctctccc tgtctccggg taaatgagtg cgacggccgc 720 

733 

gactctagag gat 



<210> 86 
<211> 86 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA Primer 
<400> 86 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 60 



cccgaaatat ctgccatctc aattag 



86 



<210> 87- 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA Primer 
<400> 87 

gcggcaagct ttttgcaaag cctaggc 27 



<210> 88 
<211> 271 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :PCR Fragment 
<400> 88 

ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 60 
aaatatctgc catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc 120 
gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 180 
ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 240 
ttttggaggc ctaggctttt gcaaaaagct t 271 



<210> 89 
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-240- 
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<211> 32 

<212> DNA 

<213> Homo sapiens 

<400> 89 

gcgctcgagg gatgacagcg atagaacccc gg 



<210> 90 

<211> 31 

<212> DNA 

<213> Homo sapiens 

<400> 90 

gcgaagcttc gcgactcccc ggatccgcct c 31 



<210> 91 

<211> 12 

<212> DNA 

<213> Homo sapiens 

<400> 91 
ggggactttc cc 



<210> 92 
<211> 73 
<212> DNA 

<213> Homo sapiens 
<400> 92 

gcggcctcga ggggactttc ccggggactt tccggggact ttccgggact ttccatcctg 60 

73 

ccatctcaat tag 



<210> 93 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :PCR Fragment 
<400> 93 

gcggcaagct ttttgcaaag cctaggc 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCTRule I3bis) 


A. The indications made below relate to the microorganism referred to in the description on page 32, lines 16-17. 


R IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet G9 


Name of depository institution 

American Type Culture Collection 


,REC'D 2 6 FEB 1998] 
fWlPO PCT H 




Address of depository institution (including postal code and country) 




10801 University Boulevard formerly at: 12301 Parklawn Drive 
Manassas, Virginia 201 1 0-2209 Rockville, Maryland 20852 
United States of America United States of America 




Date of deposit 

15 January 1998 


Accession Number 

209581 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) information is continued on an additional sheet □ 


DNA plasmid HOUCQ17 






In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made available 
until the publication of the mention of the grant of the European patent or until the date on which the application has been refused or 
withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person requesting the 
sample (Rule 28(4) EPC). 


D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE 0/tke indications are not for all designated States) 




E. SEPARATE FURNISHING OF INDICATIONS (leave 


blank if not applicable) 




The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 





For International Bureau use only 


^This sheet was received with the international application 


□ This sheet was received by the International Bureau on: 


Authorized officer 

u^; n . i It 


Authorized officer 
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DNA plasmid HOUCQ17 



AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention (Regulation 
3 .25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis of 
the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the public 
under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by the 
applicant, any request made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized experts drawn up by 
the Danish Patent office or any person approved by the applicant in the individual case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 16 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 
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DNA plasmid HOUCQ17 

Page2 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be made 
available as provided in Rule 3 1 F( 1 ) of the Patent Rules only by the issue of a sample to an expert 
The request to this effect must be furnished by the applicant with the Netherlands Industrial 
Property Office before the date on which the application is made available to the public under 
Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, whichever of 
the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be effected 
to an expert in the art The request to this effect shall be filed by the applicant with the Norwegian 
Patent Office not later than at the time when the application is made available to the public under 
Sections 22 and 33(3) of the Norwegian Patents Act If such a request has been filed by the 
applicant, any request made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized experts drawn up by the 
Norwegian Patent office or any person approved by the applicant in the individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection (by 
the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office without 
having been laid open to public inspection, the furnishing of a sample shall only be effected to an 
expert in the art The request to this effect shall be filed by the applicant with the International 
Bureau before the expiration of 16 months from the priority date (preferably on the Form 
PCT/RO/1 34 reproduced in annex Z ofVolume I of the PCT Applicant's Guide). If such a request 
has been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Swedish Patent office or any person approved by the applicant in the 
individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



Applicant's or agent's file 

reference number: 1488.107PC0* 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCT Rule 136*) 



A. The indications made below relate to the microorganism referred to in the description on page 22, lines 25-26 . 



R. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet □ 



Name of depository institution 

American Type Culture Collection 



; — D 2 o FEB 1SS8 
jwiPQ 



PCT 



Address of depository institution (including postal code and country) 



10801 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



formerly at: 



12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit 

15 January 1998 



Accession Number 

209582 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) 



This information is continued on an additional sheet D 



DNAplasmid HCE4D69 

In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made available 
until the publication of the mention of the grant of the European patent or until the date on which the application has been refused or 
withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person requesting the 
sample (Rule 28(4) EPC). 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (lean bkmk i/not appJicabie) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit"*) 



For receiving Office use only 



For International Bureau use only 



This sheet was received with the international application 



□ This sheet was received by the International Bureau on: 



Authorized officer 

in 



Authorized officer 



,J f> 



Form PCT/ROM34 (July 1992) 
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AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention (Regulation 
3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis of 
the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the public 
under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by the 
applicant, any request made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized experts drawn up by 
the Danish Patent office or any person approved by the applicant in the individual case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 16 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 
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DNA plasmid HCE4D69 
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NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be made 
available as provided in Rule 3 1 F( 1 ) of the Patent Rules only by the issue of a sample to an expert 
The request to this effect must be furnished by the applicant with the Netherlands Industrial 
Property Office before the date on which the application is made available to the public under 
Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, whichever of 
the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be effected 
to an expert in the art. The request to this effect shall be filed by the applicant with the Norwegian 
Patent Office not later than at the time when the application is made available to the public under 
Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been filed by the 
applicant, any request made by a third party for the furnishing of a sample shall indicate the expert 
to be used. That expert may be any person entered on a list of recognized experts drawn up by the 
Norwegian Patent office or any person approved by the applicant in the individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection (by 
the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office without 
having been laid open to public inspection, the furnishing of a sample shall only be effected to an 
expert in the art. The request to this effect shall be filed by the applicant with the International 
Bureau before the expiration of 16 months from the priority date (preferably on the Form 
PCT/RO/1 34 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a request 
has been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Swedish Patent office or any person approved by the applicant in the 
individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 
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